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Chapter 1 

Introduction to the Circle Method 


The Circle Method is a beautiful idea for investigating many problems in additive number theory. It 
originated in investigations by Hardy and Ramanujan ([HR], 1918) on the partition function P(n). We 
start our study of the Circle Method in §1.1 by reviewing the basic properties of P(n) via generating 
functions, and then exploring generating functions of a variety of problems. In §1.2 we state the main 
ideas of the Circle Method, and then in §1.3 we sketch its applications to writing numbers as the sums of 
primes. We then perform the detailed analysis, handling most of the technicalities, for Germain primes in 
Chapter 2. 

Our goal is to describe the key features of the Circle Method without handling all of the technical 
complications that arise in its use; we refer the reader to the excellent books [EE, Na] for complete 
details. We highlight the main ideas and needed ingredients for its application, and describe the types of 
problems it either solves or predicts the answer. 


1.1 Origins 


In this section we study various problems of additive number theory that motivated the development of 
the Circle Method. For example, consider the problem of writing n as a sum of s perfect k -powers. If 
k — 1, we have seen a combinatorial solution (see §?? and Lemma ??): the number of ways of writing n 
as a sum of s non-negative integers is ( n+ *“ 1 ). Unfortunately, this argument does not generalize to higher 
k (it is easy to partition a set into s subsets; it is not clear how to partition it into s subsets where the 
number of elements in each subset is a perfect square). There is another method, an analytical approach, 
which solves the k = 1 case and can be generalized. 

For \x\ < 1, define the generating function 


OO 

t(x) = y>"> 

m =0 


l 

1 — X 


( 1 . 1 ) 
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Let r 1>s (n) denote the number of solutions to mi H-1- m s = n where each m, is a non-negative integer. 

We claim 

( oo \ / oo \ oo 

E xmi • ■ ■ E xms ) = E («)*"• (1.2) 

This follows by expanding the product in (1.1). We have terms such as x mi ■ ■ ■ x ms , which is x mi+ '" +ms = 
x n for some n. Assuming everything converges, when we expand the product we obtain x n many times, 
once for each choice of mi,..., m s that adds to n. Thus the coefficient of x n in the expansion is f ]s {m). 
On the other hand, we have 


m s = 



1 d s ~ l 1 
(s — 1)! dx s_1 1 — x' 


Substituting the geometric series expansion for gives 


f(x) s 


1 d 3 - 1 

(s — 1)! dx s ~ l 


OO 


E x 

n =0 



(1.3) 


(1.4) 


which yields r\ iS {n) = '). It is this second method of proof that we generalize. Below we describe 

a variety of problems and show how to find their generating functions. In most cases, exact formulas such 
as (1.3) are unavailable; we develop sufficient machinery to analyze the generating functions in a more 
general setting. 


Exercise 1.1.1. Justify the arguments above. Show all series converge, and prove (1.3) and (1.4). 


1.1.1 Partitions 

We describe several problems where we can identify the generating functions. For n e N, Pin) is the 
partition function, the number of ways of writing n as a sum of positive integers where we do not 
distinguish re-orderings. For example, if n — 4 then 

4 = 4 

= 3 + 1 
= 2 + 2 
= 2 + 1+1 

= 1 + 1 + 1 +1, (1.5) 

and P(4) = 5. Note we do not count both 3 + 1 and 1 + 3. If we add the requirement that no two parts 
can be equal, there are only two ways to partition 4: 4 and 3 + 1. 
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Proposition 1.1.2 (Euler). We have as an identity of formal power series 


F(x) 


1 

(1 — x)(l — x 2 )(l — x 3 ) ■ ■ ■ 


1 + ^ P(n)x n . 

n=l 


(1.6) 


An identity of formal power series means that, without worrying about convergence, the two sides 
have the same coefficients of x n for all n. For this example, if we use the geometric series expansion 
on each , 1 , and then collect terms with the same power of x, we would have the series on the right; 
however, we do not know that the series on the right is finite for any x. 


Exercise 1.1.3. Prove the above proposition. Do the product or series converge for any x > 0? Hint: the 
combinatorial bounds from §22 might be a useful starting point. 

F{x) is called the generating function of the partition function. If f(n) is an arithmetic function (see 
Chapter ??), we can associate a generating function to / through a power series: 


F f (x) = 1 + ^/(«)x n (1-7) 

n= 1 


Exercise 1.1.4. 1. Fix m G N. For each n, let p m (n) be the number of partitions of n into numbers 

less than or equal to the given number m. Show that 


(1 — a;)(l — x 2 ) ■■■(! — x m ) = 1 + 

Does the series converge for any x > 0? 

2. Show that 

OO 

(1 + x)(l + x 2 )(l + x 3 ) ■ ■ ■ = 1 + q(n)x n . 

n =1 


( 1 - 8 ) 


(1.9) 


where q{n) is the number of partitions ofn into non-equal parts. Does this series converge for any 

x > 0? 


3. Give similar interpretations for 


1 

(1 — x)(l — x 3 )(l — x 5 ) ■ ■ ■ 

and 

(1 + X 2 ){1 + x A )(l + x 6 ) ■ ■ ■ . 
Do these products converge for any x > 0? 


( 1 . 10 ) 

(l.H) 
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One can use generating functions to obtain interesting properties of the partition functions: 

Proposition 1.1.5. Let n G N. The number of partitions ofn into unequal parts is equal to the number of 
partitions of n into odd numbers. 


Exercise 1.1.6. Prove Proposition 1.1.5. Hint: 

1 _ i _Ji _ ™6 

(1 + *)(1 + * 2 )(1 + X 3 ) • • • = —3 • • • (1.12) 

1 — £ 1 — or 1 — X 6 


(1 — a;)(l — x 3 )(l — x 5 ) • • •' v ' ' 

For more examples of this nature, see Chapter XIX of [HW]. 

So far, we have studied power series expansions where the coefficients are related to the function we 
want to study. We now consider more quantitative questions. Is there a simple formula for P{n)1 How 
rapidly does P(n) grow as n —» oo? Using the Circle Method, Hardy and Ramanujan showed that 


P(n) ~ 


e vy/2n/Z 

4 ns/3 


We prove similar results for other additive problems. 


(U4) 


1.1.2 Waring’s Problem 

It is useful to think of the partition problem in §1.1.1 as the study of the number of ways that a given 
number n can be written as a sum 

y>* (us) 

i 

for k — 1, with the number of terms ranging from n (when each n % = 1) to 1 (when n \ = n). See also 
§??. We can now formulate the following question: 

Question 1.1.7. Let k e N. Let Pifn) be the number of ways that n can be written as the sum of perfect 
k th powers. Can one calculate Pi, (n) ? 

It is clear that for all n, P n (n) is non-zero as n can be written as the sum of n ones. There is a 
striking difference between this case and the problem of P(n) in §1.1.1. The difference is that if n is a 
natural number and m < n, then one can easily write a partition of n into m numbers. For higher powers, 
however, this is false; in fact not true even for k = 2. For example, 3 cannot be written as the sum of two 
squares. Hence we ask the following questions: 
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Question 1.1.8 (Waring’s Problem). Let k E N. What is the smallest number s such that every natural 
number can be written as the sum of at most s perfect k th powers? Does such an s exist? If s exists, how 
does s depend on k ? 

These questions can easily be translated to questions involving appropriate generating functions, as 
we now explain. For Question 1.1.7, we easily see that 


1 + ^P k (n)x n 

n= 1 


l 

(1 — aT fc )(l — x 2k )(l — x 3k ) • • • ’ 


(1.16) 


however, this expansion is only useful if we can use it to calculate the P k (n)s. For Question 1.1.8, consider 
the auxiliary function 

OO 

Qk(x) = 5 > nfe - (1.17) 

n =0 

As an identity of formal power series, we have 


Qk(x) s = 1 + ^a(n; k,s)x n , (1.18) 

n— 1 

where a(n; k, s ) is the number of ways to write n as a sum of exactly s perfect k th powers. 

Exercise 1.1.9. Prove (1.18). 

Remark 1.1.10 (Important). So far, all we have done is to use generating functions to find an equivalent 
formulation for the original problem. We must find a good way to determine a(n: k, s). 

If we could show that given a k there exists an s such that for all n, a(n: k, s) f 0, then we would 
have proved every number is the sum of s perfect k th powers. The smallest such s, if it exists, is usually 
denoted by g(k). In 1770 Waring stated without proof that every natural number is the sum of at most 
nine positive perfect cubes, also the sum of at most 19 perfect fourth powers, and so on. It was already 
known that every number is a sum of at most four squares. It is usually assumed that Waring believed that 
for all k, g{k) exists. Hilbert [Hil] proved Waring’s conjecture in 1909, though his method yielded poor 
bounds for the true value of g(k). 

Exercise 1.1.11. Show that no number of the form Ak + 3 can be the sum of two squares. Show that no 
number of the form 4 a (8 k + 7) is the sum of three squares. This exercise shows that we cannot write all 
sufficiently large numbers as the sum of three squares. 

Exercise 1.1.12. Let n k = 2 k (|) fc — 1. How many perfect k th powers are needed to represent n k as a 
sum of k ,h powers? Conclude that g(k) > 2 k + (|) fc — 2. This gives g{2) > 4, g( 3) > 9, < 7 (4) > 19, .... 
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Exercise 1.1.13. Using density arguments, we can often prove certain problems have no solutions. Show 
there are not enough perfect squares to write any (large) number as the sum of two squares. Use this 
method to determine a lower bound for how many perfect k' h powers are needed for each k. 

Let us concentrate on g( 2) = 4. As we now know that infinitely many numbers cannot be the sum of 
three squares, we need to show that every natural number can be written as a sum of four squares. There 
are many proofs of this important fact, the first of which is due to Lagrange (though it is believed that Dio- 
phantus was familiar with the theorem). One proof uses geometric considerations based on Minkowski’s 
theorem (see [Ste]). We refer the reader to Chapter XX of [HW] for three interesting proofs of the theo¬ 
rem, as well as [Na]. We are particularly interested in the proof in §20.11 and §20.12 of [HW] which uses 
generating functions. We set 

OO 

9(x) = xm2 ' (1.19) 

m =—oo 

If r(n) is defined by 

OO 

0(x) A = 1 + r(n)x n , (1-20) 

n= 1 

then r(n) is equal to the number of representations of n as the sum of four squares: 

r(n) = m 2 , m 3 , mf) : rn l e Z, n — m\ + m 2 2 + m\ + rn\ }. (1.21) 

Here the m,s are integers, and different permutations of the r/+s are counted as distinct. One can show 
that 

OO 

0( x y = 1 + 8 c n x n (1.22) 

n= 1 

where 

c n = m. (1.23) 

m|n,4fm 

Thus, r(n) = 8 c(n). As c n > 0, this implies that every integer is the sum of four squares. 

Exercise 1.1.14. Give exercises sketching proof of claim. 

The above is a common feature of such proofs: we show the existence of at least one solution by 
showing there are many. We proved n is the sum of four squares by actually finding out how many 
different ways n is the sum of four squares. In our investigations of other problems, we will argue 
similarly. 
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1.1.3 Goldbach’s conjecture 

Previously we considered the question of determining the smallest number of perfect k th powers needed 
to represent all natural numbers as a sum of k th powers. One can consider the analogous question for 
other sets of numbers. Namely, given a set A, is there a number sa such that every natural number can be 
written as a sum of at most .s 4 elements of A1 A set of natural arithmetic interest is the set P of all prime 
numbers. Goldbach, in a letter to Euler (June 7, 1742), conjectured that every integer is the sum of three 
primes. Euler reformulated this conjecture to every even integer is the sum of two primes. 

Exercise 1.1.15. Prove that if every integer is the sum of at most three primes, then every even number 
must be the sum of at most two primes. Conversely, show if every even integer is the sum of at most two 
primes, every integer is the sum of at most three primes. 

To date, Goldbach’s conjecture has been verified for all even numbers up to 2 • 10 16 (see [01]). There 
are deep unconditional results in the direction of Goldbach’s conjecture: 

1. Shnirel’man proved sp < oo. The proof is based on an ingenious density argument (see [Na], 
Chapter 7). 

2. Estermann [Estl] proved that almost every even number is the sum of two primes. 

3. Vinogradov showed every large enough odd number is the sum of three primes. We discuss the 

proof of Vinogradov’s theorem later. Vinogradov proved his theorem in [Vinl, Vin2], where he 
reformulated the Circle Method from the language of complex analysis to that of Fourier series. 
[ChWa] has shown that sufficiently large may be taken to be e e ' . 

4. Chen proved every even number is the sum of a prime and a number that is at most the product of 
two primes. Chen’s theorem is based on a sieve argument (see [Na], Chapters 9 and 10). 

In the next section we describe the key ideas of the Circle Method. This will allow us to approximate 
quantities such as a(rv,k,s) (see (1.18)). We return to generating function approaches to Goldbach’s 
conjecture in §1.3. 


1.2 The Circle Method 

We explain the key features of the Circle Method. We reinterpret some of the problems discussed in §1.1 
in this new language. 
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1.2.1 Problems 

The Circle Method was devised to deal with additive problems of the following nature: 

Problem 1.2.1. Given some subset AcN and a positive integer s, what natural numbers can be written 
as a sum of s elements of A, and in how many ways? Explicitly, what is 

T" ■ ■ ■ T - a>s ■ £ A} n FA. (1.24) 

More generally, one has 

Problem 1.2.2. Fix a collection of subsets Ai ,..., A s C FA and study 

{cii + • • • + a, s : a,i G A ,} n FA. (1.25) 

We give several problems where the Circle Method is useful. We confine ourselves to two common 
choices for A. The first choice is P, the set of primes: P = (2,3, 5, 7,11,... }. We denote elements of P 
by p. The second choice is K, the set of k th powers of non-negative integers; K = (0,1, 2 fc , 3 fc , 4 fc ,... }. 
We denote elements of K by n k . 

1. Consider A = P and s = 2. Thus we are investigating 

{Pi + P 2 ■ Pi prime} IT FA. (1-26) 

This is Goldbach’s conjecture for even numbers. 

2. Again let A = P but now let s — 3. Thus we are investigating 

{pi +P 2 + P 3 - Pi prime} fl N. (1-27) 

Vinogradov’s theorem asserts that every large enough odd number is included in the intersection. 

3. Let A = K and fix a positive integer s. We are studying 

{n\ + --- + n k s : m e N}nN. (1.28) 

This is Waring’s problem. 

4. Let — P = (—2, —3, —5,... }. If we consider P — P, we have 

{pi~p 2 }AN. (1.29) 

This tells us which numbers are the differences between primes. A related question is to study how 
many pairs (pi,p 2 ) satisfy p\ — p 2 = n. If we take n = 2, pi and p 2 are called twin primes. 
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In the following paragraphs we sketch the main ideas of the Circle Method, first without worrying 
about convergence issues, then highlighting where the technicalities lie. In Chapter 2 we work through 
all but one of these technicalities for a specific problem; the remaining technicality for this problem has 
resisted analysis to this day. We have chosen to describe an open problem rather than a problem where all 
the difficulties can be handled for several reasons. The first is that to handle these technicalities for one 
of the standard problems would take us too far afield, and there are several excellent expositions for those 
desiring complete detail (see [Da2, EE, Na]). Further, there are numerous open problems where the Circle 
Method provides powerful heuristics that agree with experimental investigations; after working through 
the problem in Chapter 2 the reader will have no trouble deriving such estimates for additional problems. 

1.2.2 Setup 

Let us consider Problem 1.2.1. As before, we consider a generating function 

F a (x) = X>“- ( L3 °) 

a£A 


Next, we write 

OO 

Fa(x) s = y~^r(n;g, A)x n . (1.31) 

n =1 

Exercise 1.2.3. Prove r(n ; s, A) is the number of ways of writing n as a sum of s elements of A. 

An equivalent formulation of Problem 1.2.1 is the following: 

Problem 1.2.4. Determine r(n ; s, A). 

In order to extract individual coefficients from a power series we have the following standard fact from 
complex analysis: 

Proposition 1.2.5. 1. Let 7 be the unit circle oriented counter-clockwise. Then 


1 

27t i 



1 ifn = - 1 ; 
0 otherwise. 


(1.32) 


2. Let P(z ) = Yl'kLo a k zk be a power series with radius of convergence larger than one. Then 


1 

27T i 


p w 


—n—1 


dz 


CL n . 


(1.33) 
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See §?? for a sketch of the proof, or any book on complex analysis (for example, [Al, La5]). Conse¬ 
quently, ignoring convergence problems yields 


r(n-s,A) = ^ f F^zYz-^dz. 


Definition 1.2.6 (e(x)). We set 
Exercise 1.2.7. Let m, n G Z. Prove 


e(x = e 


2ixix 


e(nx)e(—mx)dx = 


1 if n — m; 
0 otherwise. 


(1.34) 


(1.35) 


(1.36) 


An alternative, but equivalent, formulation is to consider a different generating function for A: 

f A (x) = y^e(ax). (1.37) 


a£A 


Again, ignoring convergence problems, 

i 

/ a(x) s e(—nx)dx = r(n;s,A). (1.38) 

If we can evaluate the above integral, not only will we know which n can be written as the sum of s 
elements of A , but we will know in how many ways. 

Exercise 1.2.8. Using exercise 1.2.7, prove (1.38). 



1.2.3 Convergence Issues 

The additive problem considered in Problem 1.2.1 is interesting only if A is infinite; otherwise, we can 

just enumerate oq H-1- a s in a finite number of steps. If A is infinite, the defining sum for the generating 

function f A (x) need not converge, or may not have a large enough radius of convergence. For each N, 
define 

A n = {aeA:a<N} = A D (0,1,..., N}. (1.39) 

Note the A N s are an increasing sequence of subsets 

An c An- h, (1-40) 
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and 


(1.41) 


lim An = A. 

N—>00 

For each N, we consider the truncated generating function attached to /l y: 

In(x) = ^ e(ax). (1.42) 

flG-A jy 

As [n (x) is a finite sum, all the convergence issues vanish. A similar argument as before yields 

./WA)* = ^2 r N (n-,s,A)e(nx), (1.43) 

n<sN 


except now we have r y (n: s, A), which is the number of ways of writing n as the sum of s elements of A 
with each element at most N. If n < N, then r\/v(n; s, A) = r(n ; s, A), the number of ways of writing n 
as the sum of s elements of A; note /jv A) s is the generating function for the sum of s elements (at most 
N) of A 

For example, if A = P (the set of primes), N = 10 and s = 2, then A w = P 10 = (2, 3, 5, 7}. 
An easy calculation gives ri 0 (8; 2, P ) = r(8; 2, P ) = 2. However, ri 0 (14; 2, P) = 1 (from 7 + 7) but 
r(14; 2, P) = 3 (from 7 + 7, 3 + 11, and 11 + 3). 

We have shown the following, which is the key re-formulation of these additive problems: 


Lemma 1.2.9. If n < N then 

r{n ; s, A) 


r N (n-,s , 



fN(x) s e( 


nx)dx. 


(1.44) 


However, having an integral expression for r'A-(n: s, A ) is not enough; we must be able to evaluate 
the integral (either exactly, or at least bound it away from zero). Note /n(x) has Ay terms, each term 
of absolute value 1. In many problems, for most x G [0,1] the size of /WA) is about y/\A N \, while for 
special x G [0,1] one has /y(.x) is of size \A N \. The main contribution to the integral is expected to 
come from x where /WA) is l ar g+ an( i often this integration can be performed. If we can show that the 
contribution of the remaining x is smaller, we will have bounded ry (n: s, A) away from zero. 


1.2.4 Major and Minor arcs 

The difficultly is evaluating the integral in Lemma 1.2.9. Many successful applications of the Circle 
Method proceed in the following manner: 

1. Given a set A, we construct a generating function /n(x) for A N . As /WA) is a sum of complex 
exponentials of size 1, we expect there will often be significant cancellation. See the comments 
after Theorem ?? for other examples of similar cancellation in number theory. 
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2. Split [0,1] into two disjoint pieces, called the Major arcs M. and the Minor arcs m. Then 

r(m;s,A) = rjv(m;s,A) = / f^(x)e(—mx)dx+ / f s N {x)e{—mx)dx. (1-45) 

J Ai J m 

The construction of M. and m depend on N and the problem being studied. 

3. On the Major arcs M. we find a function which, up to lower order terms, agrees with f^(x') and is 
easily integrated. We then perform the integration, and are left with a contribution over the Major 
arcs which is bounded away from zero and is large. 

4. One shows that as N —> oo, the Minor arcs’ contribution is of lower order than the Major arcs’ 
contribution. This implies that for n large, r^(n: s. A) > 0, which proves that large n can be 
represented as a sum of s elements of A. 

The last is the most difficult step. It is often highly non-trivial to obtain the required cancellation over 
the Minor arcs. For the problems mentioned, we are able to obtain the needed cancellation for A = P 
and s = 3 (every large odd number is the sum of three primes), but not A = P and s = 2; we give some 
heuristics in §1.3.7 as to why s = 2 is so much harder than s = 3. For A = K (the set of k th powers 
of integers), we can obtain the desired cancellation for s = $(k) sufficiently large. Hardy and Littlewood 
proved we may take s(k ) = 2 k + 1. Wooley and others have improved this result; however, in general we 
expect the result to hold for smaller s than the best results to date. 


1.2.5 Historical Remark 


We briefly comment on the nomenclature: we have been talking about the Circle Method and arcs, yet 
there are no circles anywhere in sight! Let us consider an example. Recall from Proposition 1.1.2 that the 
generating function for the partition problem is 


F(x) 


1 

(1 — x)(l — a: 2 )(l — x 3 ) ■ ■ ■ 


1 + P(n)x n . 

n=l 


(1.46) 


By (1.34), and ignoring convergence issues, we need to consider 


P(n) 


1 

2ni 


F{z)z 


—n— 1 


dz. 


(1.47) 


The integrand is not defined at any point of the form e(|). The idea is to consider a small arc around each 
point e(^). This is where F(z) is large. At least intuitively one expects that the integral of F(z) along 
these arcs should be the major part of the integral. Thus, we break the unit circle into two disjoint pieces, 
the Major arcs (where we expect the generating function to be large), and the Minor arcs (where we expect 
the function to be small). While many problems proceed through generating functions that are sums of 
exponentials, as well as integrating over [0,1] instead of a cirlce, we keep the original terminology. 
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1.2.6 Needed Number Theory Results 

In our applications of the Circle Method, we need several results concerning prime numbers. These will 
be used to analyze the size of the generating function on the Major arcs. As we have seen in §?? and 
§??, it is often easier to weight primes by logp in sums, and then remove these weights through partial 
summation. We use the following statements freely (see, for example, [Da2] for proofs). We constantly 
use partial summation; the reader is advised to review the material in §??. 

Theorem 1.2.10 (Prime Number Theorem). Let 7r(x) denote the number of primes at most x. Then 
there is a constant c < 1 such that 

5>gp = x + O (xexp(—cy/loga;)) . (1-48) 

p<x 


Equivalently, by partial summation we have 


7t(x) = ^^1 = Li(x) + O ^xexp(—-y/log 

p<x 


X, 


(1.49) 


where Li(x ) is the logarithmic integral, which for any fixed positive integer k has the Taylor expansion 

rx dt 


Li(x) = 


x 


1 2 logt logic log X 


l\x (k — l)!x ^ 

+ ••• + —- r 1 — + 0 


log K X 


X 


log fc+1 X 


(1.50) 


The above is the original version of the Prime Number Theorem. The error term has been strengthened 
by Korobov and Vinogradov to O (x exp (— c e v^log x) ) for any 0 <!• All we will need is 


x 


7T X = 


logx 


+ o 


X 


logx 


, ^^logp = x T o(x). 

p<x 


(1.51) 


Exercise 1.2.11. Using partial summation, deduce a good estimate for n(x) from (1.48). 
Exercise 1.2.12. Prove (1.50). 

Theorem 1.2.13 (Siegel-Walfisz). Let C, B > 0 and let a and q be relatively prime. Then 

i> gp = W) +0 i^k) 

V=a(q ) 


for q < log 5 x, and the constant above does not depend on x, q or a (i.e., it only depends on C and B). 
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One may interpret the Siegel-Walfisz Theorem as saying each residue class has, to first order, the same 
number of primes. Explicitly, for a fixed q there are f(q) numbers a relatively prime to q. Up to lower 
order terms each residue class has primes (see §??, §??). Note the main term is larger than the error 
term if we choose C sufficiently large. If we were to take q as large as x 5 for some S > 0, then the error 
term would exceed the main term; we want to apply this theorem when q is much smaller than x. The 
choice of the Major arcs is crucially influenced by the error term in the Siegel-Walfsiz Theorem. 


1.3 Goldbach’s conjecture revisited 

While we discuss the complications from estimating the integral over the Minor arcs below, we do not 
give details on actually bounding these integrals; the interested reader should consult [Da2, EE, Est2, Na]. 
It is our intention to only introduce the reader to the broad brush strokes of this elegant theory. 

Unfortunately, such an approach means that at the end of the day, we have not solved the original 
problem. We have chosen this approach for several reasons. While the technical details can be formidable, 
for many problems these details are beautifully presented in the above (and many other) sources. Further, 
there are many applications of the Circle Method where the needed estimates on the Minor arcs are 
not known, even assuming powerful conjectures such as the Generalized Riemann Hypothesis. In these 
cases, while it is often reasonable to assume that the contribution from the Major arcs is the main term, 
one cannot prove such statements. Thus, the techniques we develop are sufficient to allow the reader to 
predict the answer for a variety of open problems; these answers can often be tested numerically. 

For these reasons, we describe the ideas of the Circle Method for Goldbach’s problem: What are the 
Major and Minor arcs? Why do we obtain the necessary cancellation when s = 3 but not when s — 2? 
These examples are well known in the literature, and we content ourselves with a very brief introduction. 
In Chapter 2 we give a very thorough treatment of another Circle Method problem, Germain primes, 
which has applications to cryptography. The techniques for this problem suffice to estimate the Major arc 
contributions in many other problems (for example, how many twin primes are there less than x). 

We do not always explicitly compute the error terms below, often confining ourselves to writing the 
main term and remarking the correction terms are smaller. As an exercise, the reader is encouraged to 
keep track of these errors. 

1.3.1 Setup 

The Circle Method begins with a choice of a generating function specific to the problem. For analytical 
reasons (see remark ?? and §1.2.6), it is often convenient to analyze the weighted generating function 

Fn(x ) = log p ■ e(px) (1.53) 

p<N 
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instead of /jv(x), and pass to the unweighted function by partial summation. One could worked only with 
Jn(x) (see [Est2], Chapter 3); however, we prefer to use F N (x) as the weights are easily removed and 
simplify several formulas. Working analogously as before, to write m as a sum of s primes leads us to 

R,N )S ( j ni) = / Ff(x)e(—mx)dx, (1-54) 

Jo 

where now 

Rn, s (m) = 22 log Pi''' log p s - (1.55) 

P iH- \-p s =m 

Pi <N 

Exercise 1.3.1. Relate R^^m) an( J r/v(m; s, P). For details, see §2.7. 

Thus, if we can show R NiS {m ) is positive for N and m sufficiently large, then r(m;s,P) is also 
positive. 

1.3.2 Average Value of \F N (x)\ 2 

We use the little-Oh notation (see definition ??). Thus, N + o(N) means the answer is N plus lower order 
terms. Recall 

Fm(x) = 22 logp • e(px) (1.56) 

p<N 

Lemma 1.3.2. |Fjv(x)| < N + o(N). 

Proof. By the Prime Number Theorem, (1.48), we have 

22 l°gP ' e(px) < 22 logP = A + o(N) 

p<N p<AT 

Lemma 1.3.3. F N ( 0) = Fjv(1) = N + o(N), and F N (^) = —N + o(N). 

Proof. F n { 0) and F N ( 1) are immediate, as e(p • 1) = 1 for all p. For F N (^), note 

e(p.i) = e”* = h if?iSOdd 

V 2 ) 1+1 if pis even 

As there is only one even prime, 

Fn Q') = log 2 - 22 lo SP’ 

' ' 3 <p<N 

and the argument proceeds as before. 


F n (x) | = 


(1.57) 

□ 


(1.58) 


(1.59) 

□ 
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Exercise 1.3.4. How large are F N (F) and F^( f)? How big can o{N) be in Lemma 1.3.3? 

Thus F n (x) is occasionally as large as N; in §1.3.5 we describe the x where F N (x) is large. We can, 
however, show that the average square of F N (x) is significantly smaller: 

Lemma 1.3.5. The average value of \F N (x)\ 2 is N log N + o(N log N). 


Proof. The following trivial observation will be extremely useful in our arguments. Let g(x) be a complex¬ 
valued function, and let g(x) be its complex conjugate. Then \g{x)\ 2 = g(x)g(x). In our case, as 
F n (x) = F n (—x) we have 


\F]s[(x)\ 2 dx = / F]sr(x)FN(—x)dx 


/ ^ log p-e(px)J2 log q- 

p<N q<N 

EE logplogg / 

AT AT JO 


e(—qx)dx 


e ((p — q)x) dx. 


(1.60) 


p<N q<N 


By exercise 1.2.7, the integral is 1 if p = q and 0 otherwise. Therefore the only pairs (p. q) that contribute 
are when p — q, and we have 


i 

\F N {x)\ 2 dx = ]>>gV 

p<N 

Using partial summation (see exercise 1.3.9), we can show 

]Tlog 2 p = iVlogiV + o(N log N). 

p<N 



Thus 



F N (x)\ 2 dx 


NlogN + o(N log N). 


(1.61) 


(1.62) 


(1.63) 

□ 


Remark 1.3.6. The above argument is extremely common. The absolute value function is not easy to 
work with; however, qix)q(x) is very tractable (see also §11). In many problems, it is a lot easier to study 
f \ 9 (x)\ 2 than f \g(x)\ or J \g(x)\ 3 . 
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Remark 1.3.7 (Philosophy of Square-root Cancellation). The average value of\F N (x) | 2 is about N log N, 
significantly smaller than the maximum possible value ofN 2 . Thus, we have almost square-root cancella¬ 
tion on average. In general, if one adds a “random” set of N numbers of absolute value 1, the sum could 
be as large as N, but often is at most of size \/N. For more details and examples, see §?? and §??. 

Exercise 1.3.8. Investigate the size of e 27T, ' x ' 2 ' p for p prime. Hint: rewrite the sum as two sums by 
using the Legendre symbol (see §??). 

Exercise 1.3.9. Using the Prime Number Theorem and Partial Summation, prove 

5>g 2 p = iVlogiV + o(NlogN). (1.64) 

p<N 


1.3.3 Large Values of F N (x ) 

For a fixed B, let Q = log B N. Fix a q < 0 and an a < q with a and q relatively prime. We evaluate 
F n . While on average F N (x) is of size y/ N log N, for x near such | we shall see that F N (x ) is large. 


F n 



Y log p-e 

p<N 



(1.65) 


The summands on the right hand side depend weakly on p. Specifically, the exponential terms only depend 
on p mod q, which allows us to rewrite F N as a sum over congruence classes: 


Fn 



q 

EE log p ■ e 

T —1 p=r(q) 
p<N 

q 

EE log p ■ e 

T =1 P=r(q) 
p<N 





q / 

E f ar 

e \j) ^ 

T —1 x 7 p=r(q) 
p<N 


logp. 


( 1 . 66 ) 


We use the Siegel-Wall!sz Theorem to evaluate the sum over p = r mod q. We first remark that we may 
assume r and q are relatively prime (see exercise 1.3.10). Briefly, if p = r mod q, this means p = aq + r 
for some a 6 N. If r and q have a common factor, there can be at most one prime p (namely r) such 
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that p = r mod q, and this can easily be shown to give a negligible contribution. For any C > 0. by the 
Siegel-Walfisz Theorem 


J> gp ^ + 0 te)' 

p<N 


(1.67) 


As <f>(q ) is at most q which is at most log 5 N, we see that if we take C > B then the main term is 
significantly greater than the error term. Note the Siegel-Walfisz Theorem would be useless if q were 
large, say q ~ N s . Then the main term would be like N x ~ s , which would be smaller than the error term. 
Thus we find 



(r, 9 ) = l 


( 1 . 68 ) 


If the sum over r in (1.68) is not too small, then F N is “approximately” of size with an error of 

size ]q ,/ X n v - If C > 21 j. the main term is significantly larger than the error term, and Fn j is large. 

The Siegel-Walfisz Theorem is our main tools for evaluating the necessary prime sums, and it is useful 
only when the error term is less than the main term. Our investigations of the (potential) size of F N (x) 
lead us to the proper definitions for the Major and Minor arcs in §1.3.4. 


Exercise 1.3.10. Show the terms with r and q not relatively prime in (1.66) contribute lower order terms. 


1.3.4 Definition of the Major and Minor Arcs 

We split [0,1] into two disjoint parts, the Major and the Minor arcs. As |F/v(a;)| 2 is of size NlogN on 
average, there is significant cancellation in Fn(x ) most of the time. The Major arcs will be a union of 
very small intervals centered at rationals with small denominator relative to N. Near these rationals we 
can approximate F N (x) very well, and F N (x) will be large (of size N ). The Minor arcs will be the rest 
of [0,1], and here we expect F N (x) to be significantly smaller than N. Obtaining such cancellation in 
the series expansion is not easy - this is the hardest part of the problem. In many cases we are unable 
to prove the integral over the Minor arcs is smaller than the contribution from the Major arcs, though we 
often believe this is the case, and numerical investigations support such claims. 
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Major Arcs 


The choice of the Major arcs depend on the problem being investigated. In problems where the Siegel- 
Walfisz Theorem is used, the results from §1.3.3 suggest the following choice. Let B > 0, and let 
Q = log B N -C N. For each q e {1,2,... ,Q} and a G {1,2,... ,q} with a and q relatively prime, 
consider the set 


M 


a,q 



e [0.1] : 




(1.69) 


We also add in one interval centered at either 0 or 1, i.e., the interval (or wrapped-around interval) 


0, 



u 



(1.70) 


Exercise 1.3.11. Show that if N is large then the Major arcs A4 ajJ are disjoint for q < Q and a < q, a 
and q relatively prime. 

We define the Major arcs to be the union of the arcs A4 aj] : 


Q q 

M = U U Ma, q , (1.71) 

< 7=1 0=1 

(a,q )=i 


where (a, q) is the greatest common divisor of a and q. 

Remark 1.3.12. As the Major arcs depend on N and B, we should write fA a>q (N, B ) and M.(N, B); 
however, for notational convenience these subscripts are often suppressed. 

Exercise 1.3.13. Show \M.\ < As Q = log B N, this implies \M\ —> 0 as N —> oo. Thus in the limit 
most of [0,1] is contained in the Minor arcs; the choice of terminology reflects where F N (x) is large, and 
not which subset of [0,1] is larger. 

Note that the above choice for the Major arcs has two advantages. First, recall that we required the 
denominator q to be small relative to A r : q < Q = log /; N. Once a denominator is small for some N, we 
can apply the Siegel-Walfisz Theorem and we can evaluate F N (^) well (see §1.3.3). Second, each Major 

arc M. a , q has length ^ = 2lo ^ — ; as these intervals are small, we expect F N (x) « F N (^). It should be 
possible to estimate the integral over A4 a , q . Thus, for a fixed the size of the arc about it tends to zero 
as N tends to infinity, but F^(x) becomes better and better understood in a smaller windows about |. 

Exercise 1.3.14. For large N, find a good asymptotic formula for \AA\. 

Exercise 1.3.15. For a fixed B, how large must N be for the Major arcs to be disjoint? 
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Minor Arcs 

The Minor arcs, nr, are whatever is not in the Major arcs. Thus, 

nr = [0,1]-M (1.72) 

Clearly, as N —► oo almost all of [0,1] is in the Minor arcs. The hope is that by staying away from 
rationals with small denominator, we will be able to obtain significant cancellation in F N (x). 


1.3.5 The Major Arcs and the Singular Series 

We are trying to write m as a sum of s primes. Let us consider the case m — N and s = 3. We have 
shown the (weighted) answer is given by 


F N {x) 3 e(—Nx)dx 


Y log pi logp 2 logp 3 ; 

Pl.P2'P3< JV 

P1+P2+P3= n 


(1.73) 


the weights can easily be removed by partial summation. We merely sketch what happens now; we handle 
a Major arc calculation in full detail in Chapter 2. 

First one shows that for x G M a , q , F N (x) is very close to F N While one could calculate the 
Taylor Series expansion (see §??), in practice it is technically easier to find a function which is non¬ 
constant and agrees with Fn(x) at x = L As the Major arcs are disjoint for large N, 



F N (x) 3 e(—Nx)dx 



F N (x) 3 e(—Nx)dx. 


(1.74) 


For heuristic purposes, we approximate F N (x) 3 e(—Nx) by F N ^ J e y—N^J . After reading Chapter 2 
the reader is encouraged to do these calculations correctly. Therefore 

I^F N (x) 3 e(-Nx)dx « I^F n ^ e dx = F N ^ e (1.75) 

In (1.68) we used the Siegel-Walfisz Theorem to evaluate F N Again, for heuristic purposes we 
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suppress the lower order error terms, and find that the contribution from the Major arcs is 


^2 logpilogp 2 logp 3 

P1,P2’P3^ n 

P1+P2+P3= n 



iV^1.76) 


To complete the proof, we need to show that what multiplies N 2 is positive and not too small. If N 2 
were multiplied by for example, the main term from the Major arcs would be of size jj, which could 
easily be cancelled by the contribution from the Minor arcs. An elementary analysis often bounds the 
factor away from 0 and infinity. 

Note that, up to factors of log A r (which are important!), the contribution from the Major arcs is of 
size N 2 . A more careful analysis, where we do not just replace fN(x) 3 e(—Nx) with /Ar(^) 3 e N^j on 
A4 a q , would show that the Major arcs contribute 

N' 2 

6(N)— + o(iV 2 ), (1.77) 


with 


e(iV) = £ 

q =1 



M 3 rr 


(a,q) = l 



(1.78) 


&(N) is called the Singular Series; in all Circle Method investigations, the contribution from the Major 
arcs is given by such a series. The singular series for Germain primes will be discussed in detail in Chapter 
2; for complete details on the singular series for sums of three primes, the interested reader should see 
[EE, Na]. If we set 


then one can show 


c r m 


S(N) 



if p\N 
otherwise 

(1.79) 

cgATA 

0(P) 3 ) ’ 

(1.80) 


The product expansion is a much more useful expression for the factor multiplying N 2 than the series 
expansion. If N is odd, there exist constants c± and c 2 such that 


0 < ci < &(N) < c 2 < 00 . 


(1.81) 
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This allows us to conclude the Major arcs’ contribution is of order N 2 . 

We do not go into great detail concerning the arithmetic properties of &(N), and content ourselves 
with an important observation. If &(N) > c± for all N, then the main term will be greater than the error 
term for N sufficiently large. Can &(N) ever vanish? 

Consider N even. Then c 2 (N) = 1, </>(2) = 1, and the factor in &(N) corresponding to p = 2 
vanishes! Thus, for even N, the main term from the Circle Method is zero. In hindsight, this is not 
surprising. Assume an even N > 6 can be written as the sum of three primes. Exactly one of the primes 
must be even (if all or exactly one were odd, then N would be odd; if all were even, N would be 6). 
Therefore, if the Circle Method tells us that we can write an even N as the sum of three primes, we could 
immediately conclude that iV — 2 is the sum of two primes. 

The Singular Series “knows” about the difficulty of Goldbach. For many Circle Method problems, 
one is able to write the main term from the Major arcs (up to computable constants and factors of log N ) 
as &(N)N a , with &(N) a product over primes. The factors at each prime often encode information about 
obstructions to solving the original problem. For more on obstructions, see §??. 

Exercise 1.3.16. For N odd, show there exist positive constants c i, c 2 (independent of N) such that 0 < 
ci < &(N) < c 2 < oo. 

Exercise 1.3.17. In the spirit of exercise 1.1.13, we sketch a heuristic for the expected average value of 
the number of ways of writing n as a sum of k primes. Consider n G [. N , 2A r ] for N large. Count the 
number ofk-tuples of primes with Pi + ■ ■ ■ + Pk £ [N, 2 N], As there are approximately N even numbers 
in the interval, deduce the average number of representations for such n. What if we instead considered 
short interx’als, such as n G [. N , A rl f j'or some § > 0? 

Exercise 1.3.18. Prove (1.78) implies the product representation in (1.80). Hint: many of the sums of 
arithmetic functions arise in the Germain prime investigations; see §2.6. 

Remark 1.3.19 (Goldbach). If instead we investigate writing even numbers as the sum of two primes, 
we would integrate F N (x) 2 and obtain a new singular series, say &(N). The Major arcs would then 
contribute 6(N)N. 
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1.3.6 Contribution from the Minor Arcs 

We bound the contribution from the Minor arcs to R NtS (N): 


F N (x) 3 e(—Nx)dx < / \F N (x)\ 3 dx 


< (max |.F/v(x)|') / \F N (x)\ 2 dx 

\ xem J J m 

< (max I -Fat (a) I) / F N (x)F N (-x)dx 

\x£m J J Q 

< ("max |Fjv(x)| N ) N\ogN. 

\ x£m / 


(1.82) 


As the Minor arcs are most of the unit interval, replacing f m with Jj' does not introduce much of an 
over-estimation. In order for the Circle Method to succeed, we need a non-trivial, good bound for 


max |F/v(x)| 


(1.83) 


This is where most of the difficulty arises, showing that there is significant cancellation in F\j(x) if we 
stay away from rationals with small denominator. We need an estimate such as 


or even 


max lEWaAl < —r—-, 

xem 1 iVV n ~ \ 0 g 1+€ N 


max |F A r(x)| o 

xGm 


(1.84) 


(1.85) 


xSm ylog N J 

Relative to the average size of \F N (x)\' 2 , which is A r log A r , this is significantly larger. Unfortunately, 
as we have inserted absolute values, it is not enough to bound \F N (x) on average - we need to obtain a 
good bound uniformly in x. We know such a bound cannot be true for all x £ [0,1], because Fn(x) is 
large on the Major arcs! The hope is that if x is not near a rational with small denominator, we will obtain 
moderate cancellation. While this is reasonable to expect, it is not easy to prove; the interested reader 
should see [EE, Na]. Following Vinogradov [Vinl, Vin2] one shows 


max |E)v(x)| < D , 
xem log N 


( 1 . 86 ) 


which allows one to deduce any large odd number is the sum of three primes. While (1.63) gives us 
significantly better cancellation on average, telling us that \F n (x )\ 2 is usually of size N, bounds such as 
(1.86) are the best we can do if we require the bound to hold for all x £ m. 
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Exercise 1.3.20. Using the definition of the Minor arcs, bound 



F N (x) 2 \dx 



F N {x) 2 \dx 


(1.87) 


Show, therefore, that there is little harm in extending the integral of \F N (x)' 2 \ to all of[ 0,1]. In general, 
there is very little loss of information in integrating \ F N (x) 1 . 


1.3.7 Why Goldbach’s Conjecture is Hard 


We give some arguments which indicate the difficulty of applying the Circle Method to Goldbach’s con¬ 
jecture. To investigate R NtS (N), the number of ways of writing N as the sum of s primes, we considered 
the generating function 

F n (x) = y log p ■ e (jpx) , (1.88) 

p<N 


which led to 


R n , s (N) = / F N (x) s e(—Nx)dx. 


(1.89) 


Remember that the average size of \F N (x)\ 2 is NlogN. 

We have seen that, up to logarithms, the contribution from the Major arcs is of size N 2 for s = 3. 
Similar arguments show that the Major arcs contribute on the order of N s_1 for sums of s primes. We 
now investigate why the Circle Method works for s — 3 but fail for s = 2. 

When s — 3, we can bound the Minor arcs contribution by 


F N (x) 3 e(—Nx)dx 


< max|F/v(x)| / \Fn(x)\ 2 cI 

< 




max | F n (x) | ■ N log N. 


(1.90) 


As the Major arcs contribute &(N)N 2 , one needs only a small savings on the Minor arcs; Vinogradov’s 
bound 

N 

max Fjv(a;) -C -—. (1.91) 

log N 

suffices. What goes wrong when s — 2? The Major arcs’ contribution is now expected to be of size N. 
How should we estimate the contribution from the Minor arcs? We have F]y(x) 2 e(—Nx). The simplest 
estimate to try is to just insert absolute values, which gives 


F N (x) 2 e(—Nx)dx 


< 


\F N (x)\ 2 dx = N. 


(1.92) 
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Note, unfortunately, that this is the same size as the expected contribution from the Major arcs! 

We could try pulling a max l£m Fv(x)| outside the integral, and hope to get a good saving (pulling 
out \Fn(x) ( l> clearly cannot work as the maximum of this is at least N log N). The problem is this leaves 
us with f m \F N (x)\dx. As F N (x) on average is of size y/N log N (this is not quite right: we have only 
shown \F n (x )\ 2 on average is A r log N; however, let us ignore this complication and see what bound we 
obtain), replacing F v (x) in the integral with its average value leads us to 


F N (x) 2 e(—Nx)dx < max |F/v(:r)| • \J N log N. 

m * em 

As the Major arcs’ contribution is of size N, we would need 


max |Ftv(x)| 

trGm 



(1.93) 


(1.94) 


There is no chance of such cancellation; this is better than square-root cancellation, and contradicts the 
average value of \F N (x)\ 2 from (1.63). 

Another approach is to use the Cauchy-Schwarz Inequality (see Lemma ??): 


Thus 


\f{x)g{x)\dx < 


\f(x)\ 2 dx 


\g(x)\ 2 dx 


(1.95) 


F/v (x) 2 e (— mx) dx 




r 1 

< 

max F/vhr) 

/ 1 



Jo 


F N (x)\dx 


< 

< 


max |Fjv(x)| 



max|Fjv(x)| • (AHogiV)^ • 1. 



(1.96) 


Unfortunately, this is the same bound as (1.93), which was too large. 


Remark 1.3.21. Even though it failed, it was a good idea to use the Cauchy-Schwartz inequality. The 
reason is we are integrating over a finite interval; thus f Q l 2 dx is harmless; if the size of the interval 
depended on N (or was all ofWL), applying Cauchy-Schwartz might be a mistake. 


While the above sketch shows the Circle Method is not, at present, powerful enough to handle the 
Minor arcs’ contribution, all is not lost. The quantity we need to bound is 


F N (x) 2 e(—mx)dx 


(1.97) 
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However, we have instead been studying 



F N (x)\ 2 dx 


(1.98) 


and 


max |F/v(x)| 



F N (x)\dx. 


(1.99) 


We are ignoring the probable oscillation and cancellation in the integral f m F/^(x) 2 e( — mx)dx. It is this 
expected cancellation that would lead to the Minor arcs contributing significantly less than the Major arcs. 

However, showing there is cancellation in the above integral is very difficult. It is a lot easier to work 
with absolute values. Further, just because we cannot prove that the Minor Arc contribution is small, does 
not mean the Circle Method is not useful. Numerical simulations confirm, for many problems, that the 
Minor arcs do not contribute for many N. For example, for N < 10 9 , the observed values are in excellent 
agreement with the Major arc predictions (see [Ci, Sch, Weir]).ADD DATA?? 
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Chapter 2 

Circle Method: Heuristics for Germain Primes 


We apply the Circle Method to investigate Germain primes. As current techniques are unable to ade¬ 
quately bound the Minor arc contributions, we concentrate on the Major arcs, where we perform the cal¬ 
culations in great detail The methods of this chapter immediately generalize to other standard problems, 
such as investigating twin primes or prime tuples. 

We have chosen to describe the Circle Method for Germain primes as this problem highlights many 
of the complications that arise in applications. Unlike the previous investigations of writing N as a sum 
of s primes, our generating function F N (x) is the product of two different generating functions. To 
approximate F N (x) on the Major arc M a , q , we could try to Taylor expand; however, the derivative is not 
easy to analyze or integrate. Instead we construct a new function which is easy to integrate on [— 
has most of its mass concentrated near |, and is a good approximation to F N (x) on M a , q - To show the 
last claim requires multiple applications of partial summation. For numerical investigations of the Minor 
arcs, as well as spacing properties of Germain primes, see [Weir]. 

In §2.1 and §2.2 we define Germain primes, the generating function Fn(x), and the Major and Minor 
arcs. In §2.3 we estimate F N (x) and find an easily integrable function u(x) which should be close to 
F n (x) on the Major arcs. We prove u(x) is a good approximation to F N (x) in §2.4; this is a technical 
section and can easily be skimmed on a first reading. We then determine the contribution from the Major 
arcs by performing the integration in §2.5 and then analyzing the singular series in §2.6. Finally, in §2.7 
we remove the logp weights and then conclude with some exercises and open problems. 


2.1 Germain Primes 

Consider an odd prime p. Clearly p — 1 cannot be prime, as it is even; however, l Fr could be prime, and 
sometimes is as p = 5, 7 and 11 show. 

Definition 2.1.1 (Germain Prime). A prime p is a Germain prime (or p and Fy- are a Germain prime 
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pair) if both p and ^-A are prime. An alternate definition is to have p and 2p -]- 1 both prime. 

Germain primes have many wonderful properties. Around 1825, Sophie Germain proved that if p is 
a Germain prime, then the first case of Fermat’s Last Theorem, which states the only integer solutions of 
x p + y p = z p have p\xyz, is true for exponent p. For more on Fermat’s Last Theorem, see §??. As another 
application, recent advances in cryptography are known to run faster if there are many Germain primes 
(see [AgKaSa]). 

Germain primes are just one example of the following type of problem: Given relatively prime positive 
integers a and b, for p < N how often are p and op + b prime? Or, more generally, how often are 
p, a \p + 6 1 , a k p + b k prime? One well known example is the famous Twin Prime Conjecture, which 
states that there are infinitely many primes p such that p + 2 is also prime. It is not known if this is true. 
Unlike the sum of the reciprocals of the primes, which diverges, Brun has shown that the sum of the 
reciprocal of the twin primes converges (see [Na]). Therefore, if there are infinitely many twin primes, 
there are in some sense fewer twin primes than primes. Explicitly, Brun proved that there exists an A), 
such that for all N > N 0 , the number of twin primes less than N is at most . This should be 
compared to the number of primes less than N, which is of size Using the Circle Method, Hardy 
and Littlewood were led to conjectures on the number of such primes, and their Major arc calculations 
agree beautifully with numerical investigations. 

We have chosen to go through the calculation of the number of Germain primes less than N rather 
than twin primes as these other problems are well documented in literature ([Da2, EE, Est2, Na]). Note 
the Germain problem is slightly different from the original formulation of the Circle Method. Here, we 
are investigating how often p\ — 2 p 2 = 1, with p 2 < p\ < N. Let 

Ai = {p : p prime} 

A 2 = {—2p : p prime}. (2.1) 

To construct generating functions that converge, we consider the truncated sets 

Ain = {p : P prime, p < N} 

A 2N = {— 2p : p prime, p < N}. (2.2) 

We are interested in 

{(fli; d 2 ) : cii + a 2 — 1, ai G A^n}. (2.3) 

In the original applications of the Circle Method, we were just interested in whether or not a number m 
was in An + • • • + An. To show m could be written as the sum of elements a* G An, we counted the 
number of ways to write it as such a sum, and showed it was positive. 

For Germain primes and related problems, we are no longer interested in determining all numbers that 
can be written as the sum + a 2 . We only want to find pairs with ai + a 2 = 1. The common feature 
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with our previous investigations is showing how many ways certain numbers can be written as the sum of 
elements in n- Such knowledge gives estimates for the number of Germain primes at most N. 

For any N > 5 we know 1 G Ain + A 2 n as 5 is a Germain prime. Note the number of ways of writing 
1 as di + a 2 with a* G A iN is the number of Germain primes at most N; similar to before, we need to 
compute for this problem r(l; A 1N , A 2N ), with the obvious notation. 

Exercise 2.1.2. Looking at tables of primes less than 100, do you think there will be more Germain primes 
or twin primes in the limit? What if you study primes up to 10 4 ? Up to 10 8 ? What percent of primes less 
than 100 GO 4 ,10 8 ) are Germain primes? Twin primes? How many primes less than N (for N large) do 
you expect to be Germain primes? Twin primes? 

Exercise 2.1.3. By the prime number theorem, for primes near x the average spacing between primes is 
log x. One can interpret this as the probability a number near x is prime is We flip a biased coin with 

probability ^4- of being a prime, 1 — of being composite; this is called the Cramer model. Using 
such a model predict how many Germain primes and twin primes are less than N. 

Remark 2.1.4 (Remark on the previous exercise). The Cramer model of the previous exercise cannot 
be correct - knowledge that p is prime gives some information about the potential primality of nearby 
numbers. One needs to correct the model to account for congruence information. See §?? and [Rubl ]. 


2.2 Preliminaries 


We use the Circle Method to calculate the contribution from the Major arcs for the Germain problem, 
namely, how many primes p < N there are such that is also prime. As pointed out earlier, for this 
problem the Minor Arc calculations cannot be determined with sufficient accuracy to be shown to be 
smaller than the Major arc contributions. We will, however, do the Major arc calculations in complete 
detail. Let 


e(x) 

A(n) 


^2nix 

j log p if n — p is prime; 
I 0 otherwise. 


(2.4) 


We constantly use the integral version of partial summation (§??) and the Siegel-Walfisz Theorem (The¬ 
orem 1.2.13). We have introduced the arithmetic function A (n) for notational convenience. In applying 
partial summation, we will have sums over integers, but our generating function is defined as a sum over 
primes; A (n) is a convenient notation which allows us to write the sum over primes as a sum over integers. 
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2.2.1 Germain Integral 


Define 


Fin(x) = 22 log pi • e(p l x) 

Pi<N 

F 2N {x) = 22 lo g^ 2 ' ei-Zpix) 

P2<N 

F n (x) = 22 22 logPllogP2 ' e ^ Pl ~ 2p ^ x "> = F 1N (x)F 2N (x). (2.5) 

pi<N p2<N 

F n (x) is the generating function for the Germain primes. As F N (x) is periodic with period 1, we can 
integrate either over [0,1] or [—|, |]. We choose the latter because the main contribution to the integral is 
from x near 0, although both choices obviously yield the same result. Letting r(l; Ain, A 2 n) denote the 
weighted number of Germain primes, we have 


r(l]A 1N ,A 2N ) = J i F N (x)e(-x)dx 

1 

E E logpilogp 2 f e((pi — 2p 2 — l)x)dx. 


pi<N P2<N 


By exercise 1.2.7 


e ((pi — 2p 2 — l)x) dx = 


1 if pi — 2p 2 — 1 = 0 
0 if pi — 2p 2 — 1^0 


( 2 . 6 ) 


(2.7) 


In (2.6) we have a contribution of logpi logp 2 if pi and p 2 = are both prime and 0 otherwise. Thus 


r(l; Ai N , A 2N ) = / F N (x)e(-x)dx = 22 logpilogp 2 . 


( 2 . 8 ) 


P1<N 
_ Pi -1 


The above is a weighted counting of Germain primes. We have introduced these weights to facilitate 
applying the Siegel-Walfisz formula; it is easy to pass from bounds for r(l; Ain, A 2 n ) to bounds for the 
number of Germain primes (see §2.7). 


Remark 2.2.1. Using the X-function from (2.4), we can rewrite the generating function as a sum over 
pairs of integers instead of pairs of primes: 


N N 

= E E A(mi)A(m 2 ) • e((mi — 2 mfjx). (2.9) 

m\ = l 777.2 — 1 
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Of course, the two functions are the same; sometimes it is more convenient to use one notation over the 
other. When we apply partial summation, it is convenient if our terms are defined for all integers, and not 
just at primes. 

Exercise 2.2.2. Determine (or at least bound) the average values of F\ m(x), F 2 n(x) and F^j(x). Hint: 
for Fn(x), use the Cauchy-Schwartz inequality. 


2.2.2 The Major and Minor Arcs 

Let B, D be positive integers with D > 2 B. Set Q = log D N. Define the Major arc M a , q for each pair 
(a, q ) with a and q relatively prime and 1 < q < log /; by 


if ^ f \ and 


M 


a,q 





■Mi £ 




Q 1 
N’ 2 


( 2 . 10 ) 


( 2 . 11 ) 


Remember, as our generating function is periodic with period 1, we can work on either [0,1] or [— \]. As 
the Major arcs depend on N and D, we should write A4 a , q (N, D ) and A4(N, I) ); however, for notational 
convenience these subscripts are often suppressed. Note we are giving ourselves a little extra flexibility 
by having q < log B N and each M a , q of size log N — . We see in §2.5 why we need to have D > 2 B. 

By definition, the Minor arcs m are whatever is not in the Major arcs. Thus the Major arcs are the 
subset of [—|, |] near rationals with small denominators, and the Minor arcs are what is left. Here near 
and small are relative to N. Then 


r(l; Ai N , A 2 n) = / F N (x)e{—x)dx = / F N (x)e(—x)dx + / F N (x)e(—x)dx. (2.12) 

We will calculate the contribution to r(l; Ai N , A 2N ) from the Major arcs, and then in §2.7 we remove the 
log^ weights. 

We chose the above definition for the Major arcs because our main tool for evaluating Fjy(x) is the 
Siegel-Walfisz formula (Theorem 1.2.13), which states that given any B,C > 0, if q < \og B N and 
(r, q) = 1 then 


l °SP 


p<N 

p= r (q) 


N 


+ o 


N 


log c N 


(2.13) 


For C very large, the error term leads to small, manageable errors on the Major arcs. For a more detailed 
explanation of this choice for the Major arcs, see §1.3.3 and §1.3.4. 
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We show the Major arcs contribute, up to lower order terms, T 2 N, where T 2 is a constant independent 
of N. By choosing B , C and D sufficiently large we can ensure that the errors from the Major arc 
calculations are less than the main term from the Major arcs. Of course, we have absolutely no control 
over what happens on the Minor arcs. Similar to Chapter 1 (see §??), up to powers of logiV we have 
F n (x) on average is of size N, but is of size N 2 on the Major arcs. As there is a lot of oscillation in the 
generating function F N (x), for generic x G [—|, |] we expect a lot of cancellation in the size of F N (x). 
Unfortunately, we are unable to prove that this oscillation yields the Minor arcs contributing less than the 
Major arcs. 

We highlight the upcoming calculations. On the Major arcs A4 a , g , we find a function u of size N 2 
such that the error from u to F N on A4 aq is much smaller than N 2 , say N 2 divided by a large power of 
log N. When we integrate u over the Major arcs, we find the main term is of size N (because up to powers 
of log N the Major arcs are of size -4), and we succeed if we can show the errors in the approximations 
are much smaller than N, say N divided by a large power of log N. Numerical simulations for x up to 
10 9 and higher support the conjecture that the Minor arcs do not contribute for the Germain problem. 
Explicitly, the observed number of Germain prime pairs in this range agrees with the prediction from the 
Major arcs (see [Weir]). We content ourselves with calculating the contribution from the Major arcs, 

/ F N (x)e{—x)dx. (2.14) 

JM 


2.3 Fn{x) and u(x) 

After determining F N on M a , q , we describe an easily integrable function which is close to F N on M a , q - 
We calculate F N j for q < log 5 N. Define the Ramanujan sum c q (a) by 

c g( a ) = e ( r ~) ' (2 ' 15) 

r =1 ^ ' 

(r,q)=l 

As usual, we evaluate Fn with the Siegel-Walfisz Theorem (Theorem 1.2.13). We restrict below to 
(ry, q) — 1 because if (r t , q) > 1, there is at most one prime j> l = r, mod q, and one prime will give a 
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negligible contribution as N —> oo. See also §1.3.5. We have 


F n I 1 = 


pi<N 

Q 


Y log pi • e (pi-) Y logp2 ' e [~ 2p2 


P2<N 


r i = l pi<JV 
Pl=ri(q ) 


E E log Pi ■ e I pi- E E logp 2 • e ( - 2 p 2 


r2=l P2^ N 

P2= r i(q) 


e r i- 


E 

ri = 1 

Q 

E'i'. 


E 

r 2 =l 


-2a 


e r 2 - 


lo SPi X] lo SP2 


P1 <iV 
pi = r i(i) 


P 2<N 

P2= r 2(q) 


=1 

( r i ji)—i 


E 

r 2 = l 
(p 2 .9) = 1 


-2a 


e r 2 - 


27 


+ 0 


9 7 l>(?) viog c ivyJ 


AT 


c q( a ) c q( 2a) 2 


27 + O 


iV 2 


<K<?) 2 ' Vlog c 

Exercise 2.3.1. S/iow that the contribution from one prime may safely be absorbed by the error term. 


(2.16) 


Let 

N N 

u{x ) = Y J e (( m i ~ 2 m 2 )x). 

7711 = 1 7712 — 1 

As m(0) = A r2 , it is natural to compare F N (x) on the Major arcs to 

c g (a)c q (-2a) ( a\ 

m 2 u v q) ’ 


(2.17) 


(2.18) 


as these two functions agree at x — A The function u(x) is a lot easier to analyze than F N (x). We show 

for x G M a , q that there is negligible error in replacing F N (x) with Cq(p Yi 2a) ti(x — 2 ). We then integrate 
over A4a q, and then sum over all Major arcs. We describe in great detail in Remark 2.5.9 why it is natural 
to consider u(x). 


2.4 Approximating F\ (x) on the Major arcs 


In this technical section we apply partial summation multiple times to show u is a good approximation to 
F N on the Major arcs A4 a q . Define 


C q (a) 


Cq(a)c q (—2a) 

0(g) 2 


(2.19) 
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We show 


Theorem 2.4.1. For a £ M. a , q , 

Fn£) = C q {a)u (a-^j+O . (2.20) 

For a £ M a , q , we write a as (3 + P £ [— . Remember Q = log D N and q < \og B N. Note 

F n (x) is approximately C q (a)N 2 for x near and from our definitions of F N , u and C q (a), (2.20) is 
immediate for a = F The reader interested in the main ideas of the Circle Method may skip to §2.5, 
where we integrate u(x) over the Major arcs. The rest of this section is devoted to rigorously showing that 

| F n {x) — C q (a)u(x — |)| is small. 

The calculation below is a straightforward application of partial summation. The difficulty is that we 
must apply partial summation twice. Each application yields two terms, a boundary term and an integral 
term. We will have four pieces to analyze. The problem is to estimate the difference 

S a>q (cx) = F N (a) - C q {a)u (a - ^ = F N (p + ^ - C q (a)u(P). (2.21) 

Recall that q < log B N and F N (^) = C q (a)u( 0) is of size ^2 • To prove Theorem 2.4.1 we must show 

that \S a , q (a)\ < Xog c N - 2 2 D N • As mentioned in Remark 2.2.1, it is easier to apply partial summation if 
we use the A-formulation of the generating function Fy because now both Fy and u will be sums over 

mi,m 2 < N. Thus 


Sa,q(oi) = y X(m l )X(m 2 )e((mi-2m 2 )P) - C q (a) y e((mi~2m 2 )P) 

m\ ,rri2<N mi 1 rri2<N 


E 

rai,77l2<-/V L 


A(mi)A(m 2 )e ( (mi - 2m 2 )- ) - C q (a) 


e ((mi - 2 m 2 )P) 


E 


.1712 <N L 


A(mi)A(m 2 )e ( (mi - 2m 2 )- ) - C q {a) 


E 

mi<N 

y y a m 2 (mi, N)b m 2 (mi, N) 

m\<N jri2<N 

y 5 a , 9 (a;m 1 )e(m 1 /i) j 

mi <N 


e(-2 m 2 P) 


e(mi/3) 


e(miP) 


( 2 . 22 ) 
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where 


dm 2 (m i,N) = \(m l )\(m 2 )e ^(mi - 2m 2 )°^j *- C q (a) 

b m2 (mi, N) = e(-2m 2 (3) 

S a , q (or,m i) = Y a m 2 {'mi,N)b m 2 (m 1 ,N). 

m,2<N 


(2.23) 


We have written S a , q (a) as above to illuminate the application of partial summation. We hold rn \ fixed 
and then use partial summation on the m 2 - sum. This generates two terms, a boundary and an integral term. 
We then apply partial summation to the mi-sum. The difficulty is not in evaluating the sums, but rather in 
the necessary careful book-keeping required. 

Recall the integral version of partial summation (Lemma ??) states 


N „ N 

'Y J a m b(m ) = A(N)b(N ) — / A(u)b'(u)du, 


(2.24) 


m= 1 


where b is a differentiable function and A(u) = Yl m <u ° m - We apply this to a m2 (mi,N) and b m2 {rn\. N). 
As b m2 = b(m 2 ) = e(—2 (3m 2 ) = e ~ 4ml3m2 , b'(m 2 ) = —4ni(3e(—2(3m 2 ). 

Applying the integral version of partial summation to the m 2 -sum gives 


7712 <-/V 


A(mi)A(m 2 )e ( (mi - 2m 2 )- ) - C q (a) 


S a , q (cr,mi) = Y 

iri 2 <N L 

Y ®m 2 (mi, N)b m2 (rrii, N) 

2 <N 

Y a m 2 (mi, N) 


e(-2 m 2 P) 


.rri2<N 


e(—2N/3) + 4iri/3 


eN 


' U=1 


Y a m 2 (mi, N) 

Lt712<71 


e(—u/3)du. 


(2.25) 


The first term is called the boundary term, the second the integral term. We substitute these into (2.22) 
and find 


S«,„(a) = X 

mi<N 


Y a m 2 (rrii, N) 
\_m,2<N 


e(-2N0) 


e(miP) 


+ E 

4:711/3 

/•TV 

Y (m^N) 

e{—uj3)du 

mi<N 

. 

lu= 1 

_m2<u 

. 


e(miP) 


= s a , q (ar, Boundary) + S a , q (or, Integral). 


(2.26) 
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The proof of Theorem 2.4.1 is completed by showing S a>q (a; Boundary) and S a/1 {a\ Integral) are 
small. This is done in Lemmas 2.4.3 and 2.4.8 by straightforward partial summation. 

Remark 2.4.2. The factor ofAm(3 in (2.26) is from differentiating bfmf). Remember a = | + (3 is in the 

Major arc M. a ,q — | — | + % ■ Thus, \/3\ < ^ = log jy jV . Even though the integral in (2.26) is over 

a range of length N, it is multiplied by /3, which is small. If (3 was not present, this term would yield a 
contribution greater than the expected main term. 


2.4.1 Boundary Term 

We first deal with the boundary term from the first partial summation on m 2 , S aJ1 (o: : Boundary). 


Lemma 2.4.3. 


Proof. Recall that 


S a , q (a] Boundary) = O 


\og c ~ D N 


(2.27) 


m\<N L \_m2<N 


S a ,q (a; Boundary) = ^ ^ a m2 (mi,N ) e(—2N/3) e(m\(3) 


2N/3) E E« m2 (rrii, N)\ e(mi/3). (2.28) 


mi<N \_m 2 <:N J 

As | (e(—2N/3) | = 1, we can ignore it in the bounds below. We again apply the integral version of partial 
summation with 

« mi = ^2 a rn 2 (, m ii N ) = A(mi)A(m 2 )e ( (mi - 2m 2 )-J - C q (a) 

m 2 <N L V ( 1 J 


m 2 <N 


b mi = e(miP). 


(2.29) 


We find 


e(2N/3)S a , q (a; Boundary) = E E (mi, N) e(N(3) 


rrii<N \_m2<:N 


pN 

2iri/3 EE- m 2 (‘mi, N)\ e(t/3)dt. (2.30) 

’J t =0 „— . ^ . s' AT 


Jt ~ 0 m 1 <t |_m 2 <iV J 

To prove Lemma 2.4.3, it suffices to bound the two terms in (2.30), which we do in Lemmas 2.4.4 and 
2.4.5. □ 
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Lemma 2.4.4. 


mi<N \_m 2 <:N 


E E«~ (mi, N) e(N/3) = O 


log c N 


(2.31) 


Proof. As \e(N/3)\ = 1, this factor is harmless, and the m i, m 2 -sums are bounded by the Siegel-Walfisz 
Theorem. 


E E a mMi,N) = E E H m i)K m 2)eUrn 1 -2m 2 )-j-C q (a) 

mi<N m,2<N m\<N m,2<N - ' ^ 

= V A(mi)e (mi-) E A(m 2 )e - C q (a)N 2 

rn\<N V ?/J UEiV V 9/J 

- + • h ( -y + o(^l -C q (a)N> 

l f{q) Viog c ivyJ L </>(<?) Viog c iv;J 9V 


= o 


log c N 


(2.32) 


as C q (a) = Cq{a) ^ 2 2a) and \c q (b)\ < f(q). 

Lemma 2.4.5. 


/ n r 

E E (mi,A”) e(tf3)dt = O 

mi<t _rri 2 <N 


\og c ~ D N 


(2.33) 


Proof. Note \j3\ < Q = log N — , and C q (a) = For t < x/A, we trivially bound the m 2 -sum 

by 2A. Thus these t contribute at most 


E 2 Ndt = \/3\N 2 < A log D A. 


(2.34) 


An identical application of Siegel-Walfisz as in the proof of Lemma 2.4.4 yields for t > Vn, 


E E ^777,2 (mi, A) 


mi<t 7772 


c q (a)t n f t 

f(q) V log 0 N 


= O 


log c A 


c q (—2a)N / A \1 ^.. Ar 

<t>(q) Vlog C Nj\ qV 

(2.35) 
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Therefore 


\P\ 


r- N 


lt=VN 


E E dm 2 (mi, JV) 


mi<t m 2 <N 


dt 


O 


Vlog c n) 


O 



(2.36) 


□ 

Remark 2.4.6. Note, of course, that the contribution is only negligible while \(3\ < We see a natural 
reason to take the Major arcs small in length. 


Remark 2.4.7. The above argument illustrate a very common technique. Namely, if t e [0, iV] and N 
is large, the interval [0, a fN] has negligible relative length. It is often useful to break the problem into 
two such regions, as different bounds are often available when t is large and small. For our problem, the 
Siegel-Walfisz formula requires that q < log B t; this condition fails ift is small compared to q. For small 
t, the bounds may not be as good; however, the length of such an interval is so small that weak bounds 
suffice. See also the example in §11. 


2.4.2 Integral Term 

We now deal with the integral term from the first partial summation on m 2 , S a>q (a:; Integral). 

Lemma 2.4.8. 

/ N 2 \ 

S a , q (a\ Integral) = O -— C _ 2D . (2.37) 

Vlog N J 

Proof. Recall 


S a>q (or, Integral) 


47t i/3 


mi<N 


r >N 


' u =1 


E a m2 (mi,7V) 


.m2<u 


e(—u/3)du 


e(mi/3) 


(2.38) 


where 


a m2 (miiN) = A(mi)A(m 2 )e ( (mi - 2m 2 )- ) - C q (a). 


(2.39) 


We apply the integral version of partial summation, with 


J m 1 


rN 

E a m 2 (m,N) 

' u= 1 

_m2<u 

-(m i/9). 


e(—u/3)du 


(2.40) 
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We find 


S a , q {or, Integral) = 47ri/3 a m2 (rrii, N)e(—u/3)du e(N/3 ) 


mi<AT u ^ rri2<u 


+ 87T/9 2 


£ £« TO2 (mi, N)e(—e(mit)dt. (2.41) 

mi <£ “—1 m 2 <u 


The factor of 87T/3 2 = — (47n/3) • (27ri/3) and comes from the derivative of e(rri\3). Arguing in a similar 
manner as in §2.4.1, in Lemmas 2.4.9 and 2.4.10 we show the two terms in (2.41) are small, which will 
complete the proof. □ 


Lemma 2.4.9. 


47t i[3 £ f £«» (mi, N)e(—e(N(3) = O f 

_m 1 <N '' u=1 m 2 <u J 


\og c ~ D N 


(2.42) 


Proof. Arguing along the lines of Lemma 2.4.5, one shows the contribution from u < y/N is bounded by 
AHog D N. For u > y/N we apply the Siegel-Walfisz formula as in Lemma 2.4.5, giving a contribution 
bounded by 


< \P\ 


rN f ~ c q (a)u 

i i=Vn V L 0(e) 

v uN , 


log 6 N 


c q (—2a)N 


log 6 N 


— C q (a)uN ) du 


Ju=Vn log N 

" 3 \P\ 

log C N' 

As \/3\ < log N — , the above is O 

Lemma 2.4.10. 


(2.43) 


\og c ~ u N /• 


/ N rN / 

£ / . £ ° N)e(—uP)du \ e{rn\t)dt — O ( 

=1 .mi<r" =1 m 2 <« . ' 


log G - 2Z7 iV 


(2.44) 


Proof. The proof proceeds along the same lines as the previous lemmas. Arguing as in Lemma 2.4.5, one 
shows that the contribution when t < y/N or u < \/N is O (^ log c N 2 D ■ We then apply the Siegel-Walfisz 
Theorem as before, and find the contribution when t,u > y/N is 


f N f N ut N 4 /3 2 

/ / —^—dudt -C —pa—. 

h=y/N Ju=Vn log 6 N log 6 N 


< 8/3 2 


(2.45) 
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As \(3\ < the above is O ( Iog c- 2 OJV ) • 

This completes the proof of Theorem 2.4.1. 


2.5 Integrals over the Major Arcs 

We first compute the integral of u(x)e(—x) over the Major arcs and then use Theorem 2.4.1 to deduce the 
corresponding integral of F N (x)e(—x). 

2.5.1 Integrals of u(x) 


By Theorem 2.4.1 we know for x G A4 a ,g that 


F N (x) ~ C q (a)u -C O ^ 


log 6 20 N 


(2.46) 


We now evaluate the integral of u(x — ^)e(-x) over A4„ q : by Theorem 2.4.1 we then obtain the integral 
of F N (x)e(—x) over M. a , q - Remember (see (2.17)) that 


u(x) = e ((mi — 2m 2 )x). 


(2.47) 


Theorem 2.5.1. 




f a \ / w ( a \ N N 

u [a • e(— a)aa = e —-f (J ^— 

V qj \ Qj 2 Viog d n 


(2.48) 


Theorem 2.5.1 will follow from a string of lemmas on various integrals of u. We first determine the 


integral of u over all of [— \ 4], and then show that the integral of u(x) is small if \x\ > (4. 


Lemma 2.5.2. 


Proof. 


u(x)e(—x)dx = — + 0(1). 


(2.49) 


u(x)e(—x)dx = / e ((mi — 2m 2 )x) ■ e(—x)dx 


' 2 m\<N n% 2 <N 


E E / e ((mi — 2 m 2 — 1 ).r) dx. 


m±<N m, 2 <N 2 


(2.50) 
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By exercise 1.2.7 the integral is 1 if mi — 2m 2 — 1 = 0 and 0 otherwise. For m 1; m 2 G (1,..., IV}, there 
are [^] = ^ + 0(1) solutions to mi — 2m 2 — 1 = 0, which completes the proof. □ 

Define 


-10 Q- 

, h = 

'Q 1 Q' 

2 + N' N _ 


_N’2 N_ 


The following bound is crucial in our investigations. 

Lemma 2.5.3. For x € J i or T^y ^Ifrrae (1, -2}. 
Exercise 2.5.4. Prove Lemma 2.5.3. 


(2.51) 


Lemma 2.5.5. 

Proof. We have 

/ u(x)e(—x)dx = 
Jh 


f ( N 

/ u(x)e(—x)dx = O -— 

Lehu i 2 Vlog D N 


/ e ((mi>- 2m 2 — l)x) dx 

^ mi ,m2<A( 

/ e(mix) e(—2m 2 x) • e(—x)d; 

L mi<iV mi-.<" A 


(2.52) 


a; 


m2<)V 


'A L 


e(x) — e((N + l)x) 

e(—2x) — e(—2(N + l)x)] 

1 — e(x) 

l- 

1 

Os 

T 

to 

1_ 


e(—x)dx (2.53) 


because these are geometric series. By Lemma 2.5.3 we have 

f , i f 22 J AT AT 

/ u{x)e(—x)dx -C / —me -C — = - s —> 

Jr, <5 log^ A/ - 


which completes the proof of Lemma 2.5.5. 


(2.54) 

□ 


Remark 2.5.6. It « because the error term in Lemma 2.5.5 is O ( 

log iv 


N 


that we must take D > 2B. 


Lemma 2.5.7. 


, l i Q 

2'N 


! x =i-Q 

2 N 


u(x)e(—x)dx = 0(log D A^). 


(2.55) 
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Proof. The argument is similar to the proof of Lemma 2.5.5. The difference is we use the geometric series 
formula only for the mi-sum, which is -C \. As x is near the mi-sum is 0(1). There are 

N terms in the m 2 -sum. As each term is at most 1, we may bound the m 2 -sum by N. Thus, the integrand 
is 0(N). We integrate over a region of length and see that the integral is (){()) = 0(log D N ) for N 
large. □ 

Note in the above proof we could not use the geometric series for both m-sums, as near x — \ the 
second sum is quite large. Fortunately, we still have significant cancellation in the first sum, and we are 
integrating over a small region. The situation is different in the following lemma. There, both m-sums are 
large. Not surprisingly, this is where most of the mass of u is concentrated. 

Lemma 2.5.8. 



u(x)e(—x)dx 



log D N J ' 


(2.56) 


Proof. This is immediate from Lemma 2.5.2 (which shows the integral of u over |] is y T- 0(1)) 
and Lemmas 2.5.5 and 2.5.7 (which show the integral of u over |x| > ^ is small). □ 

It is now trivial to prove Theorem 2.5.1. 


Proof of Theorem 2.5.1. We have 




e(— a)da 



-^ • e(— a)da 

qJ 

(-? -z 3 ) 



l N 

/ u((3)e(—(3)d(3 




log D N J ' 


(2.57) 


□ 

Note there are two factors in Theorem 2.5.1. The first, e , is an arithmetical factor which depends 
on which Major arc A4 ajQ we are in. The second factor is universal, and is the size of the contribution. 
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Remark 2.5.9. We remark once more on the utility of finding a function u(x) to approximate F N (x), as 
opposed to a Taylor series expansion. We found a function that is easy to integrate and by straightforward 
applications of partial summation is close to our generating function. Further, most of the mass ofu(x) is 
concentrated in a neighborhood of size about 0. Hence integrating u (or its translates) over a Major 
arc is approximately the same as integrating u over the entire interx’al. While there are a few points where 
we need to be careful in analyzing the behavior ofu, the slight complications are worth the effort because 
of how easy it is to work with u(x). For this problem, it was x = 0 giving the main contribution, and 
x = ±2 was a potential trouble point which turned out to give a small contribution. The reason we need 
to check x = ±2 is due to the definition of Germain primes, namely the 2 in F 2 n(x) = ff p2<N e(—2p 2 x). 
Because of this 2, when x is near \, F 2N (x ) is near N. 


2.5.2 Integrals of F]g(x) 

An immediate consequence of Theorem 2.5.1 is 

Theorem 2.5.10. 




F N (x)e(-x)dx = C q (a)e y + O ^ 


N 


log D N 


+ 0 


N 


log c ~ 3D N 


(2.58) 


Exercise 2.5.11. Prove Theorem 2.5.10. 

From Theorem 2.5.10 we immediately obtain the integral of Fjv(x)e(— x) over the Major arcs A4: 

Theorem 2.5.12. 


F N (x)e(—x)dx = 


Im 


\og B N q / \ AT 

£ £ c.(-)« (£) f + O 


( 7=1 0=1 

(o,q) = l 

N „ 

— &NXC + O 


N 


+ 


N 


q J 2 V log B N log' 


r C-3D-2B 


N 


N 


log D 2B N log' 


+ 


N 


C—3D—2B 


N 


where 


log^ N q 


&n - Cr ‘ 

( 7=1 0=1 

(o,q) = l 

is the truncated singular series for the Germain primes. 


a 

lale I — 


(2.59) 


(2.60) 
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Proof. As 


log s N q 

M = U U M **> (2 - 61) 

(7 = 1 0=1 

(o,?) = 1 

the number of Major arcs A4 a , q is bounded by log 2/; N. In summing over the Major arcs, the error terms 
in Theorem 2.5.10 are multiplied by at most log 2B N, and the claim now follows. □ 

We will show the main term in Theorem 2.5.12 is of size N; thus we need to take D > 2B and 
C > 3 D + 2 B. We study & N in §2.6, and remove the logweights in §2.7. 


2.6 Major Arcs and the Singular Series 

If we can show that there exists a constant Co > 0 (independent of N ) such that 

&N > C 0 , 


(2.62) 


then for D > 2 B and C > 3D + 2 B by Theorem 2.5.12 the contribution from the Major arcs is positive 


and of size 6 n y for N sufficiently large. Recall 

c q (a) = 

C q (a ) = 


a 

e | r— 

r=l x 7 

(r,q )=1 


E 

r= 1 
r,q) = 

Cq (^) Cq ( 2 CL) 


<t>(q¥ 


Substituting 


Pg 


°^ a ) e 


a= 1 
(a,q) = 1 


(2.63) 


(2.64) 


into the series expansion of in (2.60), we find that 


log B N 


^ Pq- 
q =1 


(2.65) 


The singular series for the Germain primes is 


6 = 

q =1 


( 2 . 66 ) 
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We show 6 is given by a multiplicative product and is positive in Theorem 2.6.18, and in Theorem 2.6.20 
we show 16 — 6n\ = O ^ log(1 _L)B N ) f° r an Y e > 0. This will complete our evaluation of the contribution 
from the Major arcs. 

Many of the arithmetical functions we investigate below were studied in Chapter ??. Recall a function 
/ is multiplicative if f(mn) = f(jn)f(n) for m,n relatively prime, and completely multiplicative if 
f(mn ) = f(m)f(n ); see definition ??. The reader should consult Chapter ?? as necessary. 


2.6.1 Properties of Arithmetic Functions 

We follow the presentation of [Na] (Chapter 8 and Appendix A), where many of the same functions arise 
from studying a related Circle Method problem. Below we determine simple formulas for the arithmetic 
functions we have encountered, which then allows us to prove our claims about & N and 6 (see §2.6.2). 


Lemma 2.6.1. If(q, <() = 1 then we can write the congruence classes relatively prime to qq' as rq' + r'q, 
with 1 < r < q, 1 < r' < q' and (r, q) = (r', q') = 1. 


Exercise 2.6.2. Prove Lemma 2.6.1. 
Lemma 2.6.3. c q (a) is multiplicative. 
Proof. Using Lemma 2.6.1 we have 


Cq((lj)Cqi (a) 


E 

r= 1 
(r,q )=1 


a 

e I r- 


E 

r’=l 
(r f ,q f )=l 


e \ r — 
<1 


E E 

r=l r '= 1 

(r,q) = l ( r /,q') = 1 


{rq' + r'q) a 

qq' 



(r,<3 <?')=! 


(2.67) 


□ 


We will soon determine c q (a ) for (a, q) = 1. We first state some needed results. 


Lemma 2.6.4. Show that 


M«) = E e ( r S 

r=l 


d ifd\a; 

0 otherwise. 
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( 2 . 68 ) 



Exercise 2.6.5. Prove the above lemma. 

Recall the Mobius function (see §??): 

-l) r if d is the product of r distinct primes; 
0 otherwise. 


fj,(d) = 


By Lemma ??, 

Y M°0 

d\(r,q) 

Lemma 2.6.6. If (a, q) — 1 then c q (a ) = jn(q). 
Proof. 


1 if (r, q) = 1; 
0 otherwise. 


(2.69) 


(2.70) 


c q {a) = 


£ 

r= 1 
(r,q) = 1 


e|r- | = 

q. 


7“ — 1 


I] e ( r “ ) V( d ), 


(2.71) 


d|(r,q) 


where we used (2.69) to expand the sum from (r, q) — 1 to all r mod q. Further 


c q {a) = 


Y I] 

d\q 


r =1 
d\r 


a 

e | r- 

q 


(2.72) 


This is because d\(r, q) implies cZ|r and d\q, which allows us to rewrite the conditions above. We change 
variables and replace r with t, as r ranges from 1 to q through values divisible by £, l ranges from 1 to f 
We will use Lemma 2.6.4 to evaluate this sum. Therefore 


Cq{a) = 


q/d 

Yb(d)Y e ( ^ 

d\q i=\ 

Y^d)h q/d (a) 

d\q 

Y^{j) hd (°) 

d\q 

EKs)" 

d\q 

d\a 


a 

q/d 


- Ensr 

<A (a,q) 

If (a, q) — 1 then the only term above is d = 1, which yields c q (a ) = n(q). 


(2.73) 

□ 
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Corollary 2.6.7. If q = p k , k > 2 and (a, g) = 1, c 9 (a) = 0. 

We have shown c qq >(a) = c g (a)c 9 /(a) if (g, q') = 1. Recall that the Euler 0-function, 0(g), is the 
number of numbers less than g which are relatively prime to g and is a multiplicative function (see §?? 
for more details). We now have 


Lemma 2.6.8. C q (a) is multiplicative in q. 
Proof. Assume (g, q') = 1. We have 


Cqqfa) = 


Cqq 1 (&)C qq ' ( 2o) 

0(gg') 2 

Cq ( 0 ) C q > (Q-)Cg( 2fl)Cg' ( 2(l) 

0(g) 2 0(g') 2 

Cq(u)Cg( 2a) Cg/(a)Cg'( 2a) 


0(g) 2 

= C q (a)C q fa). 


0(g') s 


(2.74) 

□ 


We now prove p q is multiplicative. We first prove a needed lemma. 

Lemma 2.6.9. //(gi,g 2 ) = 1, C gi (aig 2 ) = C 91 (ai). 

Proof As C f/] (aig 2 ) = 2a W 2 ) ^ W£ se£ - t su ffj Ces to s how c qi (aig 2 ) = c qi (ai) and c qi (—2aig 2 ) = 

c 9l (—2ai). As the proofs are similar, we only prove the first statement. From the definition of c q (a), (2.63), 
we have 


c qi (aig 2 ) 



( r ’1.9l) = 1 



( r ’.9l) = 1 


(2.75) 


because (gi, g 2 ) = 1 implies that as r\ goes through all residue classes that are relatively prime to gi, so 
too does r = rig 2 . □ 
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Lemma 2.6.10. p q is multiplicative. 

Proof. Recall 

Pq = C ^ a ) e (“^) ' ( 2 ' 7 6) 

( a,q) = l 

Assume (q 1 , q 2 ) = 1. By Lemma 2.6.1 we can write the congruence classes relatively prime to qpp as 
aiq 2 + a 2 qi, with 1 < ai < qi, 1 < a 2 < q 2 and (ai, qf) = (a 2 , 72) = 1. Then 


Pqiq2 


q 192 


^ 1 C 9i92( a ) e 


a =1 

(a,9192) = 1 
9192 


7i72 


C '9i( a ) C '9 2 («)e 


a=l 

(a,9192) = 1 

91 


7l72 


92 


E E C<2l( a l72 + a 2Ql)Cq 2 ( a iq2 + 0 2 5l)( 


a^ = l a 2 = 1 

Ol>9l) =1 ( tt 2.92) =1 


Qi72 + 
m2 


(2.77) 


A straightforward calculation shows C gi (aig 2 + a 2 gi) = C' q] {ci\q 2 ) and C (n {a\q 2 + a 2 qf) = C q2 (a 2 qi), 
which implies 


91 92 


^9192 


E E C qi (aiq 2 )C q2 (a 2 qi)t 


a\ = 1 a2 = 1 

(“l>9l)=l (“2.92) =1 


91 

C qi( a ^)t 

a^= 1 

.(“l>«l)=l 

a\ = 1 

.(“l>«l) = l 


P 91 ’ Pq2 ■> 


0 1 72 + <2'27l 

m2 


CL\ 

( i\ 


92 


X! C-' ?2 («27l)f 


7i 


a 2= 1 
(“2.92) = 1 


92 


a 2 

72 


£ Cq 2 (a 2 )e I - 


02=1 
(o 2 ’ 12 ) = 1 


a 2 

72 


(2.78) 


where we used Lemma 2.6.9 to replace C ' 91 («i 7 2 ) with C gi (ai), and similarly for C (? 2 (a 2 gi). Thus, p q is 
multiplicative. □ 

Exercise 2.6.11. Prove C qi (aiq 2 + a 2 qf) = C qi (a 1 q 2 ) and C q2 (aiq 2 + a 2 qf) = C q2 (a 2 qf). 
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We now determine p q . 

Lemma 2.6.12. Ifk > 2 and p is prime then p p k = 0. 

Proof. This follows immediately from C p k(a ) = 0 (see Corollary 2.6.7 and the definition of C q (a)). □ 

Lemma 2.6.13. Ifp > 2 is prime then p p = — 

Proof. 


Pp 



(2.79) 


Forp > 2, (a,p) = 1 implies (—2 a,p) = 1 as well. By Lemma (2.6.6), c p (a ) = c p (—2a) = pfp). As 
p{p) 2 = 1 and <p(p) = p — 1 we have 


Pp — 



(2.80) 

□ 


Lemma 2.6.14. p 2 = 1. 

Proof. 


P 2 = ^ C 2 (a)e 

a=l 

(a,2)=l 


= C 2 (l)e(-- 


c 2 (1)c 2 (-2) . 




^7I"Z^ 27T7 


where we have used c 2 (l) = e" and c 2 (—2) = e 


l 2 

— 27 Tl 


■e~ m = 1 , 


(2.81) 

□ 
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Exercise 2.6.15. Prove c 2 ( 1) = e ni and c 2 (—2) = e 2m . 


2.6.2 Determination of and 6 

We use the results from §2.6.1 to study &n and 6, which from (2.65) and (2.66) are 


log B N oo 

&N = Pq, & = Y^Pq. (2.82) 

q =i 5=1 

We show that |6 — S(A r ) is small by first determining 6 (Theorem 2.6.18) and then estimating the 
difference (Theorem 2.6.20). 

Exercise 2.6.16. Let h q be any multiplicative sequence (with whatever growth conditions are necessary 
to ensure the convergence of all sums below). Prove 

OO / OO \ 

YK = n J + Ev ■ (2- 83 ) 

5=1 p prime \ k= 1 / 

Determine what growth conditions ensure convergence. 

Definition 2.6.17 (Twin Prime Constant). 

t 2 = n 

p> 2 

is the twin prime constant. Using the Circle Method, Hardy and Littlewood were led to the conjecture that 
the number of twin primes at most x is given by 


1 - 


(p-l) 2 J 


.6601618158 


(2.84) 


Mx) = 2T 2 " +of " V (2.85) 

log X V log X ) 

The techniques of this chapter suffice to determine the contribution from the Major arcs to this problem 
as well; however, again the needed bounds on the Minor arcs are unknown. 

Theorem 2.6.18. 6 has a product representation and satisfies 


6 = 2 T 2 . 


( 2 . 86 ) 
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Proof. By exercise 2.6.16 we have 


6 = ^2 Pq 

q=l 


n (+ 22 Pp k 

prime \ k =1 

n (i+ pp ) 


(2.87) 


p prime 


because p p k = 0 for k > 2 and p prime by Lemma 2.6.12. The product is easily shown to converge (see 
exercise 2.6.19). By Lemmas 2.6.13 and 2.6.14, p 2 — 1 and p p = forp > 2 prime. Therefore 


e = 


n (i+ Pp) 

p 

(i+ P2) j^(i+ p P ) 

p> 2 

1 


p>2 L 

2 T 2 . 


1 - 


(P - 1) S 


( 2 . 88 ) 


□ 

We need to estimate |6 — &n\. As p q is multiplicative and zero if q = p k (k > 2 ), we need only look at 
sums of p p . As p p = — , it follows that the difference between 6 and 6 v tends to zero as N —> 00 . 

Exercise 2.6.19. Show the product in (2.87) converges. Hint: take the logarithm, and Taylor expand. 

Theorem 2.6.20. For any e > 0 and B, N such that log 5 N > 2, 

|S “ SkI <<C 0 (log<‘-W) ' 

Proof. 

OO 

& - & N = 22 Pi- 

<jr=log S N 


(2.89) 


(2.90) 
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1 


By Lemma 2.6.12, p pk = 0 for k > 2 and p prime. By Lemma 2.6.13, p p = — jzzpp if P > 2 is prime. By 
Lemma 2.6.10, p q is multiplicative. Therefore for any e > 0 


1 1 

\Pq\ < < 


0(g) 2 


Q 


2—2e 


(2.91) 


if g is not square-free this is immediate, and for g square-free we note 0(p) = p — 1 and 0(g) S> g 1 e . 
Hence 


e 


- ®at| < £ 


l 


.,2—2e 


= o 


<j=log B iV 


log (1 - 2e)B IV 


(2.92) 


□ 


Exercise 2.6.21. For q square-free, prove that for any e > 0, 0(g) g 1-e . 

Combining the results above, we have finally determined the contribution from the Major arcs: 
Theorem 2.6.22. Let D > 2B, C > 3D + 2 B, e > 0 and log B N > 2. Then 


t M 


N 

F N {x)e(—x)dx = &—+ O 


N 


+ 


N 


+ 


N 


log (1 - 2e) BN log D ~ 2B N log' 


C-3D-2B 


NT 


(2.93) 


where & is twice the twin prime constant T 2 . 


In the binary and ternary Goldbach problems, to see if N could be written as the sum of two or three 
primes, we evaluated the Singular Series at N (see §??). Thus, even after taking limits, we still evaluated 
the Singular Series at multiple points, as we were trying to see which integers can be written as a sum 
of two or three primes, and the answer told us how many ways this was possible. Here, we really have 
6(1); knowing how large this is tells us information about what percent of primes are Germain primes 
(see §2.7). As things stand, it does not make sense to evaluate this Singular Series at additional points. 
However, if we were interested in a more general problem, such as p, are both prime, b odd, this would 
lead to pi — 2 p 2 = b. We would replace e(—x) in (2.6) with c(—bx). Working in such generality would 
lead to a Singular Series depending on b. More generally, we could consider prime pairs of the form 
p, If we take a = c and b = 2 ck, we have the special case of prime pairs, and the Singular Series 
will depend on the factorization of 2k (see [HL3, HL4]). 


Exercise 2.6.23. Redo the calculations of this chapter for one of the problems described above or in §2.1. 
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2.7 Number of Germain Primes and Weighted Sums 

We now remove the log p,, weights in our counting function. By Theorem 2.6.22, we know the contribution 
from the Major arcs. If we assume the minor arcs contribute o(N) then we would have 


Y log p- log 


p<N 

p—1 

p, ^— prime 


^ = 0y + o{N) = T 2 N + o(N). 


(2.94) 


We can pass from this weighted sum to a count of the number of Germain prime pairs (^,p) with 
p < N. Again we follow [Na], Chapter 8; for more on weighted sums, see §??. Define 


7T G (iV) = 
G(N) = 


E 1 

Y log p ■ log — ■ 


p<N 

p—1 

p, — prime 


(2.95) 


p<N 

p—1 

p, £ — 2 — prime 


Theorem 2.7.1. 


G(N) 

l^N - * aW - 

P-1 


G(N) +Q / log log JV 


log N 


log AT 


(2.96) 


Proof. In (2.95), logplog < log N. Thus G(N) < log^ N ■ tt g (N), proving the first inequality in 
(2.96). 

The other inequality is more involved, and illustrates a common technique in analytic number theory. 
As there clearly are less Germain primes than primes, for any 5 > 0 


7T G (N 


1-S\ 


E 1 ^ = 


N 


1-6 


p<N L ~° 

p—1 

p , —^— prime 


log iV 1 


-s 


•C 


A 


1-6 


log A 


(2.97) 


We now obtain a good upper bound for 7 t g (A). Ifp > A , then 

p — 1 ( 1 

log — = logp + logll- — 


> (1 — 5) log A + O - 

\P 


= (l-S)\ogN + 0 


N i-s 


(2.98) 
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In the arguments below, the error from (2.98) is negligible and is smaller than the other errors we en¬ 
counter. We therefore suppress this error for convenience. Thus, up to lower order terms, 


p — 1 


G(N) > \ogp-log 2 

pyN 1 - 1 

p—1 

p, ^-~2 — prime 

= (1 — 5) 2 log 2 N J2 1 


p>N L - d 

p—1 

p, —^— prime 


= (1 - 5) 2 log 2 N (t t g (N) - ndN 1 - 6 )) 

> (1 - 5f log 2 N ■ tt g (N ) + O ( (1 - <5) 2 log 2 N 


N l ~ 5 \ 
log N ) 


Therefore 


log 2 N • tyq(N) < (l-5)- 2 -G(N) + 0\ [ log z N-—j 
0 < log 2 N ■ n G (N) - G(N) < [(1 - 6)~ 2 - 1] G(N) + O (log N ■ lY 1 ” 5 ) . 


N 1 ~ s \ 


If 0 < 6 < then (1 — 5) 2 — 1 -C 6. We thus have 


0 < log 2 lV-7r G (lV) -G(N) « N 


6 + 0 


/ log N \ 

V N* ) 


Choose 6 = 2 — . Then we get 


0 < log 2 N • v t g (N) - G(N) < 0[N 

which completes the proof. 

Combining our results of this section, we have proved 


r log log IV 
log AT 


(2.99) 


( 2 . 100 ) 


( 2 . 101 ) 


( 2 . 102 ) 

□ 


Theorem 2.7.2. Assuming there is no contribution to the main term from the Minor arcs, up to lower 
order terms we have 

MN) = i^' (2 - 103) 

where T 2 is the twin prime constant (see definition 2.6.17). 


56 



Remark 2.7.3 (Important). It is a common technique in analytic number theory to choose an auxiliary 
parameter such as 5. Note how crucial it was in the proof for 5 to depend (cdbeit very weakly) on N. 
Whenever one makes such approximations, it is good to get a feel for how much information is lost in the 
estimation. For 6 = 2 — we have 


jyl-<5 _ . jy-21oglogiV/logAf _ jy _ e -21oglogAf 


N 

log 2 N' 


(2.104) 


Hence there is little cost in ignoring the Germain primes less than N 1 5 . Our final answer is of size yprp- 
As N 1-5 = lo j) N and there are O N j primes less than N 1-5 , there are at most O N j Germain 
primes at most N 1_5 . 


Exercise 2.7.4. Let A (n) be the von Magnoldt function (see §??). Prove 


YMn) = l°gP + OY logx). (2.105) 

n< x 


As J2 p<x is of size x, there is negligible loss in ignoring prime powers. 


2.8 Exercises 


The following problems are known questions (either on the Circle Method, or needed results to prove 
some of these claims). 

Exercise 2.8.1. Prove Ve > 0, q l ~ e -C 4>(q ) -C q. 


Exercise 2.8.2. Let 

c,(N) 

Prove c q (N) is multiplicative. Further, show 



g 2niNa/q 


a =1 
(a,q) = l 


c,(N) 


p — 1 ifp\N 
— 1 otherwise. 


Exercise 2.8.3. Prove p(q)c q (N) / f(q) 3 is multiplicative. 
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Exercise 2.8.4. Using the above exercises and the methods of this chapter, calculate the contribution from 
the Major arcs to writing any integer N as the sum of three primes. Deduce for writing numbers as the 
sum of three primes that 


6 3 {N) 


pt <K <?) 3 

TT (, | \ ^ d'(p j )Cpi(N) \ 

\ l { u ^ J 

n i 1+ (p^wji) n 0 - p^-sp+s) ■ 


Note S 3 (N) = 0 if N is even; thus the Circle Method “knows” that Goldbach is hard. We call &(N) the 

Singular Series. 


Exercise 2.8.5. Let & 3 : q(N) be the first Q terms of 63 (iV). Bound 63 (N) — & 3 : q(N). Show for N odd 
there exist constants Ci, c 2 such that 0 < C\ < 63 (AT) < c 2 < 00 . 


Exercise 2.8.6. Assume every large integer is the sum of three primes. Prove every large even integer is 
the sum of two primes. Conversely, show if every large even integer is the sum of two primes, every large 
integer is the sum of three primes. 

Exercise 2.8.7 (Non-Trivial). Calculate the Singular Series 6 2 (iV) and 6 2 ,q(AT) for the Goldbach prob¬ 
lem (even numbers as the sum of two primes), and &w,k,s(N) and &w,k,s,C}(N) for Waring’s problem 
(writing numbers as the sum of s perfect k-powers). Warning: 6 2 (iV) — &2,q(N) cannot be shown to be 
small for all even N in the Goldbach problem. Do S 2 ,g(A r ) and 6 2 (A r ) vanish for N odd? 


2.9 Research Projects 

One can use the Circle Method to predict the number of primes (or prime tuples) with given properties, 
and then investigate these claims numerically; see, for example, [Law2, Sch, Weir] (for additional Circle 
Method investigations, see [Ci]). After counting the number of such primes (or prime tuples), the next 
natural question is to investigate the spacings between adjacent elements (see Chapter ??). 

Research Project 2.9.1. For many questions in number theory, the Cramer model (see §?? and Exercise 
2.1.2) leads to good heuristics and predictions; recently, however, [MS] have shown that this model is 
inconsistent with certain simple numerical investigations of primes, and in fact the Random Matrix Theory 
model of the zeros of the Riemann Zeta function and the Circle Method give a prediction which agrees 
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beautifully with experiments. There are many additional interesting sequences of primes to investigate and 
see which model is correct. Candidates include primes in arithmetic progression, twin primes, generalized 
twin primes (fix an integer k, look for primes such that p and p+2k are prime), prime tuples (fix integers k\ 
through k r such that p.p + 2k \,... ,p + 2 k r are all prime), Germain primes, and so on. A natural project 
is to investigate the statistics from [MS] for these other sequences of primes, using the Circle Method and 
the Cramer model to predict two answers, and then see which agrees with numerics. ADD REF to papers 
from Brent 

Research Project 2.9.2. In many successful applications of the Circle Method, good bounds are proved 
for the generating function on the Minor arcs. From these bounds it is then shown that the Minor arcs’ 
contribution is significantly smaller than that from the Major arcs. However, to prove that the Major arcs 
are the main term does not require one to obtain good cancellation at every point in the Minor arcs; all 
that is required is that the integral is small. 

For problems such as Goldbach’s conjecture or Germain primes, the needed estimates on the Minor 
arcs are conjectured to hold; by counting the number of solutions, we see that the integral over the Minor 
arcs is small (at least up to about 10 9 ). A good investigation is to numerically calculate the generating 
function at various points on the Minor arcs for several of these problems, and see how often large values 
are obtained. See [Law2] and REF TO CJM. Warning: calculations of this nature are very difficult. The 
Major arcs are defined as intervals of size 21og N — about rationals with denominators at most log B N. For 
example, if D = 10 than log D N > N until N is about 3.4 x 10 15 , and there will not be any Minor arcs! 
For N ~ 10 15 , there are too many primes to compute the generating function in a reasonable amount of 
time. Without resorting to supercomputers, one must assume that we may take B small for such numerical 
investigations. 
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